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CORYNEBACTERIUM GLUTA MI CUM GENES ENCODING PROTEINS 
INVOLVED IN GENETIC STABILITY, GENE EXPRESSION, AND PROTEIN 

SECRETION AND FOLDING 



A hstract of the Disclosure 



Isolated nucleic acid molecules, designated SES nucleic acid molecules, which 
encode novel SES proteins from Corynebacterium glutamicum are described. The 
invention also provides antisense nucleic acid molecules, recombinant expression 
10 vectors containing SES nucleic acid molecules, and host cells into which the expression 
vectors have been introduced. The invention still further provides isolated SES proteins, 
mutated SES proteins, fusion proteins, antigenic peptides and methods for the 
improvement of production of a desired compound from C glutamicum based oo 
genetic engineering of SES genes in this organism. 
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CORYNEBACTERIUM GLUTAMTCUM GENES ENCODING PROTEINS 
INVOLVED IN GENETIC STABILITY, GENE EXPRESSION, AND PROTEIN 

SECRETION AND FOLDING 

5 Background of the Invention 

Certain products and by-products of naturally-occurring metabolic processes in 
cells have utility in a wide array of industries, including the food, feed, cosmetics, and 
pharmaceutical industries. These molecules, collectively termed 'fine chemicals*, 
include organic acids, both proteinogenic and non-proteinogenic amino acids, 

10 nucleotides and nucleosides, lipids and fatty acids, diols. carbohydrates, aromatic 
compounds, vitamins and cofactors, and enzymes. Their production is most 
conveniently performed through the large-scale culture of bacteria developed to produce 
and secrete large quantities of one or more desired molecules. One particularly useful 
organism for this purpose is Corynebacterium glutamicum, a gram positive, 

15 nonpathogenic bacterium. Through strain selection, a number of mutant strains have 
been developed which produce an array of desirable compounds. However, selection of 
strains improved for the production of a particular molecule is a time-consuming and 
difficult process. 

20 Summary of the Invention 

This invention provides novel nucleic acid molecules which may be used to 
identify or classify Coryne bacterium glutamicum or related species of bacteria. C 
glutamicum is a gram positive, aerobic bacterium which is commonly used in industry 
for the large-scale production of a variety of fine chemicals, and also for the degradation 

25 of hydrocarbons (such as in petroleum spills) and for the oxidation of terpenoids. The 
nucleic acid molecules therefore can be used to identify microorganisms which can be 
used to produce fine chemicals, e.g., by fermentation processes. While C glutamicum 
itself is nonpathogenic, it is related to other Corynehacterium species, such as 
Corynebacterium diphtheriae (the causative agent of diphtheria), which are important 

30 human pathogens. The ability to identify the presence of Corynebacterium species 
therefore also can have significant clinical relevance, e.g.. diagnostic apphcanons. 
Further, these nucleic acid molecules may serve as reference points for the mapping of 
the C glutamicum genome, or of genomes of related organisms. 

These novel nucleic acid molecules encode proteins, referred to herein as genetic 
35 stability, gene expression, or protein secretion/folding (SES) proteins. These SES 
proteins are capable of. for example, performing a function involved in the repair or 
recombination of DNA. transposition of genetic material, expression of genes (i.e.. 



involved in transcription or translation), protein folding, or protein secretion in 
Corynebacterium glutamicum. Given the availability of cloning vectors for use in 
Corynebacterium glutamicum, such as those disclosed in Sinskey el al., U.S. Patent No. 
4,649.1 1 9, and techniques for genetic manipulation of C. glutamicum and the related 
5 Brevihacterium species (e.g., lactofermentum) (Yoshihama et al. ./. Bacterid. 162: 591- 
597 (1985); Katsumata et al., ./. Bacteriol. 159: 306-31 1 (1984); and Santamaria et al., J. 
Gen. MicrohioL 130: 2237-2246 (1984)), the nucleic acid molecules of the invention 
may be utilized in the genetic engineering of this organism to make it a better or more 
efficient producer of one or more fine chemicals. This improved production or 
0 efficiency of production of a fine chemical may be due to a direct effect of manipulation 
of a gene of the invention, or it may be due to an indirect effect of such manipulation. 

There are a number of mechanisms by which the alteration of an SES protein of 
the invention may directly affect the yield production, and/or efficiency of production 
of a fine chemical from a C glutamicum strain incorporating such an altered protein. For 
5 example, modulation of proteins involved directly in transcription or translation (e.g.. 
polymerases or ribosomcs) such that they are increased in number or in activity should 
increase global cellular transcription or translation (or rates of these processes). This 
increased cellular gene expression should include those proteins involved in fine 
chemical biosynthesis, so an increase in yield, production, or efficiency of production of 
one or more desired compounds may occur. Modifications to the transcriptional/ 
translational protein machinery of C. glutamicum such that the regulation of these 
proteins is altered may also permit increased expression of genes involved in the 
production of fine chemicals. Modulation of the activity or number of proteins involved 
in polypeptide folding may permit an increase in the overall production of correctly 
folded molecules in the cell, thereby increasing the possibility that desired proteins (e.g.. 
fine chemical biosynthetic proteins) are able to function properly. Further, by mutating 
proteins involved in secretion from C glutamicum such that they are increased in 
number or activity, it may be possible to increase the secretion of a fine chemical (e.g., 
an enzyme) from cells in fermentor culture, where it may be readily recovered. 

Genetic modification of the SES molecules of the invention may also result in 
indirect modulation of production of one or more fine chemicals. For example, by 
increasing the number or activity of a DNA repair or recombination protein of the 
invention, one may increase the ability of the cell to detect and repair DNA damage. 
This should effectively increase the ability of the cell to maintain a mutated gene within 
its genome, thereby increasing the likelihood that a transgene engineered into C 
glutamicum (e.g., encoding a protein which will increase biosynthesis of a fine 
chemical) will not be lost during culture of the microorganism. Conversely, by 
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decreasing the number or activity of one or more DN A repair or recombination proteins, 
it may be possible to increase the genetic instability of the organism. Such 
manipulations should improve the ability of the organism to be modified by mutagenesis 
without the introduced mutation being corrected. The same holds true for proteins 

5 involved in transposition or rearrangement of genetic elements in C glutamicum (e.g.. 
transposons). By mutagenizing these proteins such that they arc either increased or 
decreased in number or activity, it is possible to simultaneously increase or decrease the 
eenetic stability of the microorganism. This has a profound impact on the ability of any 
other mutation to be introduced into C glutamicum. and on the ability of introduced 

1 0 mutations to be retained. Transposons also offer a convenient mechanism by which 
mutagenesis of C glutamicum may be performed: duplication of desired genes (e.g.. 
fine chemical biosynthetic genes) is readily accomplished by transposon mutagenesis, as 
is disruption of undesired genes (e.g., genes encoding proteins involved in degradation 

of desired fine chemicals). 
] 5 By modulating one or more proteins (e.g. sigma factors) involved in the 

regulation of transcription or translation in response to particular environmental 
conditions, it may be possible to prevent the cell from slowing or stopping protein 
synthesis under unfavorable environmental conditions, such as those found in large- 
scale fermentor culture. This should lead to increased gene expression, which in turn 
20 may permit increased biosynthesis of desired fine chemicals under such conditions. 
Mutagenesis of proteins involved in protein secretion systems may result in modulated 
secretion rates. Many such secreted proteins have functions critical for cell ^ability 
(e g cell surface proteases or receptors). An alteration of a secretory pathway such that 
these proteins are more readily transported to their extracellular location may improve 
25 the overall viability of the cell, and thus result in greater numbers of C. glutamtcvm cells 
capable of producing fine chemicals during large-scale culture. Further, the secretion 
apparatus (e.g.. the sec system) is also known to be involved in the insertion of integral 
membrane proteins (e.g., pores, channels, or transporters) into the membrane. Thus, the 
modulation of activity of proteins involved in protein secretion from C glutarmcum may 
30 affect the ability of the cell to excrete waste products or to import necessary metabolites. 
If the activity of these secretory proteins is increased, then the ability of the cell to 
produce fine chemicals may be similarly increased. If the activity of these secretory 
proteins is decreased, then there may be insufficient nutrients available to support 
overproduction of desired compounds, or waste products may interfere with such 

35 biosynthesis. , . , A 

The invention provides novel nucleic acid molecules which encode proteins, 
referred to herein as SES proteins, which are capable of. for example, participating m 
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the repair or recombination of DNA. transposition of genetic material, expression of 
genes (i.e., the processes of transcription or translation), protein folding, or protein 
secretion in Corynebacterium gtutamicum. Nucleic acid molecules encoding an SES 
protein are referred to herein as SES nucleic acid molecules. In a preferred 
embodiment, an SES protein participates in improving or decreasing genetic stability in 
C glutamicvm, in the expression of genes (i.e.. in transcription or translate or protan 
folding in this organism, or in protein secretion from C. glutamics. Examples of such 
proteins include those encoded by the genes set forth m Table 1 . 

Accordingly, one aspect of the invention pertains to isolated nucleic add 
molecules (e.g.. cDNAs) comprising a nucleotide sequence encoding an SES protem or 
biologically active portions thereof, as well as nucleic acid fragments suable as pnmers 
or hybridization probes for the detection or amplification of SES-cncodin 8 ; nucleic acid 
(e g DNA or mRNA). In particularly preferred embodiments, the isolated nucleic acid 
molecule comprises one of the nucleotide sequences set forth in Appendix A or the 
coding region or a complement thereof of one of these nucleotide sequences. In other 
partially preferred embodiments, the isolated nucleic acid molecule of the mvenhon 
comprises a nucleotide sequence which hybridizes to or is at least about 50%, preferably 
at least about 60%. more preferably at least about 70%, 80% or 90%. and even more 
preferably at least about 95%. 96%. 97<>/<, 98%, 99% or more homologous to a 
nucleotide sequence set forth in Appendix A. or a portion thereof. In other preferred 
embodiments, the isolated nucleic acid molecule encodes one of the ammo •«* 
sequences set forth in Appendix B. The preferred SES proteins of *e present mvention 
also preferably possess at least one of the SES activities described herein. 

in anomer embodiment, the isolated nucleic acid molecule encodes a protein or 
portion thereof wherein the protein or portion thereof includes ar i amino acid sequence 
which is sufficiently homologous to an amino acid sequence of Appendi, £ **. 
sufficiently homologous to an amino acid sequence of Appendix B such that the p^tein 
orportion thereof maintains an SES activity. Preferably, the protem or poriio *er~f 
encoded by the nucleic acid molecule maintains the ability to participate in the repa* or 
3 Combination of DNA, in the transposition of genetic material, in gene expression (« 
me esses of transcription or translation), in protein folding, or in protein secretion in 
^^,1. In one embodiment, the protein encoded by the nucleic 

XoUisafleastabo.^ 

at least about 70%, 80*. or 90% and most preferably at least \ 
5 98% or99%ormorehomologoustoanaminoacidsequenceof A ^ dlxB(e J; a " 
5 98 /o, or yy /o or m sequences set forth in Appendix B). In 

entire amino acid sequence selected from those sequcn ^ m which 

another preferred embodiment, the protein is a full length C. glut* P 
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is substantially homologous to an entire amino acid sequence of Appendix B (encoded 
by an open reading frame shown in Appendix A). 

In another preferred embodiment, the isolated nucleic acid molecule is derived 
from C glutamicum and encodes a protein (e.g., an SES fusion protein) which includes 
5 a biologically active domain which is at least about 50% or more homologous to one of 
the amino acid sequences of Appendix B and is able to participate in the repair or 
recombination of DNA, in the transposition of genetic material, in gene expression (i.e.. 
the processes of transcription or translation), in protein folding, or in protein secretion in 
Corynebacterium glutamicum. or has one or more of the activities set forth in Tabic 1 . 
1 0 and which also includes heterologous nucleic acid sequences encoding a heterologous 
polypeptide or regulatory regions. 

In another embodiment, the isolated nucleic acid molecule is at least 1 5 
nucleotides in length and hybridizes under stringent conditions to a nucleic acid 
molecule comprising a nucleotide sequence of Appendix A. Preferably, the isolated 
1 5 nucleic acid molecule corresponds to a naturally-occurring nucleic acid molecule. More 
preferably, the isolated nucleic acid encodes a naturally-occurring C glutamicum SES 
protein, or a biologically active portion thereof. 

Another aspect of the invention pertains to vectors, e.g.. recombinant expression 
vectors, containing the nucleic acid molecules of the invention, and host cells into which 
20 such vectors have been introduced. In one embodiment, such a host cell is used to 

produce an SES protein by culturing the host cell in a suitable medium. The SES protein 
can be then isolated from the medium or the host cell. 

Yet another aspect of the invention pertains to a genetically altered 
microorganism in which an SES gene has been introduced or altered. In one 
25 embodiment, the genome of the microorganism has been altered by introduction of a 

nucleic acid molecule of the invention encoding wild-type or mutated SES sequence as a 
transgene. In another embodiment, an endogenous SES gene within the genome of the 
microorganism has been altered, e.g.. functionally disrupted, by homologous 
recombination with an altered SES gene. In a preferred embodiment, the microorganism 
30 belongs to the genus Corynebacterium or Brevibactertum. with Corynebacterium 

glutamicum being particularly preferred. In a preferred embodiment, the microorganism 
is also utilized for the production of a desired compound, such as an amino acid, with 
lysine being particularly preferred. 

Still another aspect of the invention pertains to an isolated SES protein or a 
35 portion, e.g., a biologically active portion, thereof. In a preferred embodiment, the 

isolated SES protein or portion thereof can participate in the repair or recombination of 
DNA. in the transposition of genetic material, in gene expression (i.e., the processes of 
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protein secretion in Corymebacterium ghtamrcum. 

preferably at least about 70'/o. 80%. or 90%, and most preferably ^ 
1 « q«v 97% 98% or 99% or more homologous to an entire amino acid sequc 
15 96%. 97 /o. y 5 a>. or nrotein compri ses an amino acid 

, . . . ^ Kv a nucleotide sequence which hybridizes, e.g.. u 7 

which is encoded by a nucleoua q ^ferably at least about 60%, more 

stringent conditions, or is at least about 50 /o, ^ feab ** ^ about 

, «. -7HO/ ono/ ft or 90% and even more preteraDiy <u. 
25 preferably at least about 70%, 80 A. or 90 nucletrti de sequence of 

95%. 96%. 97%. 98.%. or 99% or more ^ '« ^° ' also have one 
Appendix B. It is also preferred that the preferred forms of SES p 

other preferred embodiments, this toon protetn ~ ' 0 „ 

recombination of DNA. in the transposition of genenc rn . 

stein secretion in 

*. processes ^^r^^,^ 

35 



Another aspect of the invention pertains to a method for producing a fine 
chemical. This method involves the culturing of a cell containing a vector directing the 
expression of an SES nucleic acid molecule of the invention, such that a fine chemical is 
produced. In a preferred embodiment, this method further includes the step of obtaining 
a cell containing such a vector, in which a cell is transfected with a vector directing the 
expression of an SES nucleic acid. In another preferred embodiment, this method 
further includes the step of recovering the fine chemical from the culture. Tn a 
particularly preferred embodiment the cell is from the genus Corynebacterium or 
Brevi bacterium, or is selected from those strains set forth in Table 3. 

Another aspect of the invention pertains to methods for modulating production of 
a molecule from a microorganism. Such methods include contacting the cell with an 
agent which modulates SES protein activity or SES nucleic acid expression such that a 
cell associated activity is altered relative to this same activity in the absence of the 
agent. In a preferred embodiment, the cell is modulated for one or more C glutamicum 
processes involved in genetic stability, gene expression, protein folding, or protein 
secretion such that the yield, production, or efficiency of production of a desired fine 
chemical by this microorganism is improved. The agent which modulates SES protein 
activity can be an agent which stimulates SES protein activity or SES nucleic acid 
expression. Examples of agents which stimulate SES protein activity or SES nucleic 
acid expression include small molecules, active SES proteins, and nucleic acids 
encoding SES proteins that have been introduced into the cell. Examples of agents 
which inhibit SES activity or expression include small molecules and antisense SES 

nucleic acid molecules. 

Another aspect of the invention pertains to methods for modulating yields of a 
desired compound from a cell, involving the introduction of a wild-type or mutant SES 
gene into a cell, either maintained on a separate plasmid or integrated into the genome of 
the host cell. If integrated into the genome, such integration can be random, or it can 
take place by homologous recombination such that the native gene is replaced by the 
introduced copy, causing the production of the desired compound from the cell to be 
modulated. In a preferred embodiment, said yields are increased. In another preferred 
embodiment, said chemical is a fine chemical. Tn a particularly preferred embodiment, 
said fine chemical is an amino acid. In especially preferred embodiments, said amino 
acid is L-lysine. 

Detailed Description of t he Invention 

The present invention provides SES nucleic acid and protein molecules which 
are involved in the repair or recombination of DN A, in the transposition of genetic 
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material, in gene expression (i.e. the processes of transcription or translation), in protein 
folding, or in protein secretion in Corynebacierium glutamicum. The molecules of the 
invention may be utilized in the modulation of production of fine chemicals from 
microorganisms, such as C glutamicum. either directly (e.g.. where overexpression or 

5 optimization of activity of a protein involved in secretion of a fine chemical (e.g.. an 
enzyme) has a direct impact on the yield, production, and/or efficiency of production of 
a fine chemical from the modified C. glutamicum). or an indirect impact which 
nonetheless results in an increase of yield, production, and/or efficiency of production of 
the desired compound (e.g., where modulation of the activity or number of copies of a 

10 C glutamicum DNA repair protein results in alterations in the ability of the 

microorganism to maintain the introduced mutation, which in turn may impact the 
production of one or more fine chemicals from such a strain). Aspects of the invention 
are further explicated below. 

15 I. Fine Chemicals 

The term 'fine chemical' is art-recognized and includes molecules produced by 
an organism which have applications in various industries, such as. but not limited to, 
the pharmaceutical, agriculture, and cosmetics industries. Such compounds include 
organic acids, such as tartaric acid, ttaconic acid, and diaminopimelic acid, both 

20 proteinogenic and non-proteinogenjc amino acids, purine and pyrimidinc bases, 

nucleosides, and nucleotides (as described e.g. in Kuninaka. A. (1996) Nucleotides and 
related compounds, p. 561-612, in Biotechnology vol. 6, Rehm et al.. eds. VCH: 
Weinheim, and references contained therein), lipids, both saturated and unsaturated fatty 
acids (e.g., arachidonic acid), diols (e.g.. propane diol. and butane diol). carbohydrates 

25 (e.g., hyaluronic acid and trehalose), aromatic compounds (e.g.. aromatic amines. 

vanillin, and indigo), vitamins and cofactors (as described in Ullmann's Encyclopedia of 
Industrial Chemistry, vol. A27, "Vitamins-', p. 443-613 (1996) VCH: Weinheim and 
references therein: and Ong, A.S., Niki. E. & Packer, L. (1995) "Nutrition. Lipids, 
Health, and Disease" Proceedings of the UNESCO/Confederation of Scientific and 

30 Technological Associations in Malaysia, and the Society for Free Radical Research - 
Asia, held Sept. 1-3, 1 994 at Penang. Malaysia, AOCS Press, (1995)). enzymes, and all 
other chemicals described in Gutcho (1983) Chemicals by Fermentation, Noyes Data 
Corporation. ISBN: 08 1 8805086 and references therein. The metabolism and uses of 
certain of these fine chemicals are further explicated below. 
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A. Amino Acid Metabolism and Uses 

Amino acids comprise the basic structural units of all proteins, and as such are 
essential for normal cellular functioning in all organisms. The term "amino acid" is art- 
recognized. The proteinogenic amino acids, of which there are 20 species, serve as 
structural units for proteins, in which they are linked by peptide bonds, while the 
nonproteinogenic amino acids (hundreds of which are known) are not normally found in 
proteins (see Ulmann s Encyclopedia of Industrial Chemistry, vol. A2, p. 57-97 VCH: 
Weinheim (1985)). Amino acids may be in the D- or L- optical configuration, though L- 
amino acids are generally the only type found in naturally-occurring proteins. 
Biosynthetic and degradative pathways of each of the 20 proteinogenic amino acids 
have been well characterized in both prokaryotic and eukaryotic cells (see. for example, 
Stryer. L. Biochemistry, 3* edition, pages 578-590 (1988)). The 'essential' amino acids 
(histidinc. isoleucine. leucine, lysine, methionine, phenylalanine, threonine, tryptophan, 
and valine), so named because they are generally a nutritional requirement due to the 
l5 complexity of their biosynthesis, are readily converted by simple biosynthetic pathways 
to the remaining 1 1 'nonessential' amino acids (alanine, arginine. asparagine. aspartate, 
cysteine, glutamate, glutamine, glycine, proline, serine, and tyrosine). Higher animals 
do retain the ability to synthesize some of these amino acids, but the essenttal ammo 
acids must be supplied from the diet in order for normal protein synthesis to occur. 
20 Aside from their function in protein biosynthesis, these amino acids are 

interesting chemicals in their own right, and many have been found to have various 
applications in the food, feed, chemical, cosmetics, agriculture, and pharmaceutical 
industries. Lysine is an important amino acid in the nutrition not only of humans, but 
also of monogastric animals such as poultry and swine. Glutamate is most commonly 
used as a flavor additive (mono-sodium glutamate. MSG) and is widely used throughout 
the food industry, as are aspartate, phenylalanine, glycine, and cysteine. Glyc.ne. L- 
methionine and tryptophan are all utilized in the pharmaceutical industry. Glutamme. 
valine, leucine, isoleucine, histidine. arginine, proline, serine and alanine are of use m 
both the pharmaceutical and cosmetics industries. Threonine, tryptophan, and D/ L- 
methionine are common feed additives. (Leuchtenberger. W. (1996) Amino aids - 
technical production and use. p. 466-502 in Rehm et al. (eds.) Biotechnology voK 6. 
chapter 14a. VCH: Weinheim). Additionally, these amino acids have been found to be 
useful as precursors for the synthesis of synthetic amino acids and proteins^ such as N- 
acetylcysteine. S^arboxymethyl-L-cysteine. (S>5-hydroxytryptopha^, and ^others 
35 described in Ulmann's Encyclopedia of Industrial Chemistry, vol. A2, p. 57-97, VCH- 
Weinheim, 1985. 
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The biosynthesis of these natural amino acids in organisms capable of 
producing them, such as bacteria, has been well characterized (for review of bacterial 
amino acid biosynthesis and regulation thereof, see Umbarger, H.E.(1978) Ann. Rev. 
Biochem. 47: 533-606). Glutamate is synthesized by the reductive amination of a- 
ketoglutarate. an intermediate in the citric acid cycle. Glutamine, proline, and arginine 
are each subsequently produced from glutamate. The biosynthesis of serine is a three- 
step process beginning with 3-phosphoglycerate (an intermediate in glycolysis), and 
resulting in this amino acid after oxidation, transamination, and hydrolysis steps. Both 
cysteine and glycine are produced from serine; the former by the condensation of 
homocysteine with serine, and the latter by the transferal of the side-chain P-carbon 
atom to tetrahydrofolate. in a reaction catalyzed by serine transhydroxymethylase. 
Phenylalanine, and tyrosine are synthesized from the glycolytic and pentose phosphate 
pathway precursors erythrose 4-phosphatc and phosphoenolpynivate in a 9-step 
biosynthetic pathway that differ only at the final two steps after synthesis of prephenate. 
Tryptophan is also produced from these two initial molecules, but its synthesis is an 1 1- 
step pathway. Tyrosine may also be synthesized from phenylalanine, in a reaction 
catalyzed by phenylalanine hydroxylase. Alanine, valine, and leucine are all 
biosynthetic products of pyruvate, the final product of glycolysis. Aspartate is formed 
from oxaloacetatc. an intermediate of the citric acid cycle. Asparagine. methionine, 
threonine, and lysine are each produced by the conversion of aspartate. Tsoleucine is 
formed from threonine. A complex 9-step pathway results in the production of histidine 
from 5-phosphoribosyI- 1 -pyrophosphate, an activated sugar. 

Amino acids in excess of the protein synthesis needs of the cell cannot be stored, 
and are instead degraded to provide intermediates for the major metabolic pathways of 
the cell (for review see Stryer. L. Biochemistry 3 rd ed. Ch. 21 "Amino Acid Degradation 
and the Urea Cycle" p. 495-516 (1988)). Although the cell is able to convert unwanted 
amino acids into useful metabolic intermediates, amino acid production is costly in 
terms of energy, precursor molecules, and the enzymes necessary to synthesize them. 
Thus it is not surprising that amino acid biosynthesis is regulated by feedback inhibition, 
in which the presence of a particular amino acid serves to slow or entirely stop its own 
production (for overview of feedback mechanisms in amino acid biosynthetic pathways, 
see Stryer, L. Biochemistry, 3 rd ed. Ch. 24: "Biosynthesis of Amino Acids and Heme * p. 
575-600 (1988))- Thus, the output of any particular amino acid is limited by the amount 
of that amino acid present in the cell. 



B. Vitamin. Co/actor, and Nutraceutical Metabolism and Uses 

Vitamins, cofactors. and nutraceuticals comprise another group of molecules 
which the higher animals have tost the ability to synthesize and so must ingest, although 
they are readily synthesized by other organisms such as bacteria. These molecules are 
either bioactive substances themselves, or are precursors of biologically active 
substances which may serve as electron carriers or intermediates in a variety of 
metabolic pathways. Aside from their nutritive value, these compounds also have 
significant industrial value as coloring agents, antioxidants, and catalysts or other 
processing aids. (For an overview of the structure, activity, and industrial applications 
of these compounds, see. for example. Ullman's Encyclopedia of Industrial Chemistry. 
"Vitamins" voJ. A27. p. 443-613. VCH: Wcinheim. 1996.) The term "vitamin" is art- 
recoenized. and includes nutrients which are required by an organism for normal 
functioning, but which that organism cannot synthesize by itself. The group of vitamins 
may encompass cofactors and nutraceutical compounds. The language "cofactor" 
includes nonproteinaccous compounds required for a normal enrymatic activity to 
occur Such compounds may be organic or inorganic; the cofactor molecules of the 
invention are preferably organic. The term "nutraceutical" includes dietary supplements 
having health benefits in plants and animals, particularly humans. Examples of such 
molecules are vitamins, antioxidants, and also certain lipids (e.g.. polyunsaturated fatty 
acids). 

The biosynthesis of these molecules in organisms capable of producing them, 
such as bacteria, has been largely characterized (Ullman's Encyclopedia of Industrial 
Chemistry, "Vitamins" vol. A27. p. 443-613. V C H: Wcinheim, 1996; Michal. G. (1999) 
Biochemical Pathways: An Atlas of Biochemistry and Molecular Biology, John Wiley 
& Sons; Ong. A.S., Niki. E- & Packer, L- (1995) "Nutrition, Lipids, Health, and 
Disease" Proceedings of the UNESCO/Confederation of Scientific and Technological 
Associations in Malaysia, and the Society for Free Radical Research - Asia, held Sept. 
1-3. 1994 at Penang, Malaysia. AOCS Press: Champaign. IL X. 374 S). 

Thiamin (vitamin B,) is produced by the chemical coupling of pyrimidine and 
thiazole moieties. Riboflavin (vitamin B 2 ) is synthesized from guanosine-5*-triphosphate 
(GTP) and ribose-S" -phosphate. Riboflavin, in turn, is utilized for the synthesis of flavin 
mononucleotide (FMN) and flavin adenine dinucleotide (FAD). The family of 
compounds collectively termed 'vitamin B*' (e.g., pyridoxins pyridoxamine, pyndoxa- 
S'-phosphate, and the commercially used pyridoxin hydrochloride) are all derivatives of 
the common structural unit, 5-hydroxy-6-methylpyridine. Pantothenate (pantothenic 
acid, (R)-(+)-N-(2.4-dirrydroxyo,3-dimemyl-l-oxobutyl>Manine) can be produced 
either by chemical synthesis or by fermentation. The final steps in pantothenate 
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biosynthesis consist of the ATP-driven condensation of (3-alanine and pantoic acid. The 
enzymes responsible for the biosynthesis steps for the conversion to pantoic acid, to &- 
alanine and for the condensation to panthotenic acid are known. The metabolically 
active form of pantothenate is Coenzyme A, for which the biosynthesis proceeds in 5 
5 enzymatic steps. Pantothenate, pyridoxal-5' -phosphate, cysteine and ATP are the 
precursors of Coenzyme A. These enzymes not only catalyze the formation of 
panthothante. but also the production of (RVpamoic acid, (R)-pantolacton, (R> 
panthenol (provitamin B 5 ), pantetheine (and its derivatives) and coenzyme A. 
Biotin biosynthesis from the precursor molecule pimeloyl-CoA in 
10 microorganisms has been studied in detail and several of the genes involved have been 
identified. Many of the corresponding proteins have been found to also be involved in 
Fe-cluster synthesis and are members of the nifS ctass of proteins. Lipoic acid is 
derived from octanoic acid, and serves as a coenzyme in energy metabolism, where it 
becomes part of the pyruvate dehydrogenase complex and the o-ketoglutarate 
15 dehydrogenase complex. The folates are a group of substances which are all derivatives 
of folic acid, which is turn is derived from L-glutamic acid, p-amino-benzoic acid and 6- 
methylpterin. The biosynthesis of folic acid and its derivatives, starting from the 
metabolism intermediates guanosine-5--triphosphate (GTP), L-glutamic acid and p- 
amino-benzoic acid has been studied in detail in certain microorganisms. 
20 Corrinoids (such as the cobalamines and particularly vitamin B , 2 ) and 

porphyrias belong to a group of chemicals characterized by a tetrapyrole ring system. 
The biosynthesis of vitamin B I2 is sufficiently complex that it has not yet been 
completely characterized, but many of the enzymes and substrates involved are now 
known. Nicotinic acid (nicotinate). and nicotinamide are pyridine derivatives which are 
25 also termed Win. Niacin is the precursor of the important coenzymes NAD 

(nicotinamide adenine dinucleotide) and NADP (nicotinamide adenine dinucleotide 
phosphate) and their reduced forms. 

The large-scale production of these compounds has largely relied on cell-free 
chemical syntheses, though some of these chemicals have also been produced by large- 
30 scale culture of microorganisms, such as riboflavin. Vitamin B 6 . pantothenate, and 

biotin. Only Vitamin B, 2 is produced solely by fermentation, due to the complexity of 
its synthesis. In vitro methodologies require significant inputs of materials and time, 
often at great cost. 

35 C Purine. Pvrimidine. Nucleoside and Nucleotide Metabolism and Uses 

Purine and pyrimidine metabolism genes and their corresponding proteins are 
important targets for the therapy of tumor diseases and viral infections. The language 
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"purine" or "pyrimidine" includes the nitrogenous bases which are constituents of 
nucleic acids, co-enzymes, and nucleotides. The term "nucleotide" includes the banc 
structural units of nucleic acid molecules, which are comprised of a nitrogenous base, a 
pentose sugar (in the case of RNA, the sugar is ribose; in the case of DNA, the sugar is 
5 D-deoxyribose), and phosphoric acid. The language "nucleoside" includes molecules 
which serve as precursors to nucleotides, but which are lacking the phosphoric acid 
moiety that nucleotides possess. By inhibiting the biosynthesis of these molecules, or 
their mobilization to form nucleic acid molecules, it is possible to inhibit RNA and DNA 
synthesis; by inhibiting this activity in a fashion targeted to cancerous cells, the ability 
10 of tumor cells to divide and replicate may be inhibited. Additionally, there are 

nucleotides which do not form nucleic acid molecules, but rather serve as energy stores 
(i e.. AMP) or as coenzymes (i.e., FAD and NAD). 

Several publications have described the use of these chemicals for these medical 
indications, by influencing purine and/or pyrimidine metabolism (e.g. Chnstopherson. 
15 R I and Lyons. S.D. (1990) "Potent inhibitors of de n<r>o pyrimidine and punne 

biosynthesis as chemotherapeutic agents." Med. Res. Review 10: 505-548). Studies of 
enzymes involved in purine and pyrimidine metabolism have been focused on the 
development of new drugs which can be used, for example, as immunosuppressants or 
anti-proliferants (Smith, J.L.. ( 1 995) "Enzymes in nucleotide synthesis." Curr. Opt*. 
Struct Biol. 5: 752-757: (1995) Biochem Sac. Transact. 21: 877-902). However, purine 
and pyrimidine bases, nucleosides and nucleotides have other utilities: as intermediates 
in the biosynthesis of several fine chemicals (e.g.. thiamine, S-adenosyl-meth.onme. 
folates, or riboflavin), as energy carriers for the cell (e.g.. ATP or GTP), and for 
chemicals themselves, commonly used as flavor enhancers (e.g.. IMP or GMP) or for 
several medicinal applications (see. for example, Kuninaka, A. ( 1 996) Nucleotides and 
Related Compounds in Biotechnology vol. 6. Rehm et al.. eds. VCH: Wemheun. p. 561- 
612) Also, enzymes involved in purine, pyrimiditie, nucleoside, or nucleotide 
metabolism are increasingly serving as targets against which chemicals for crop 
protection, including fungicides, herbicides and insecticides, are developed. 
30 The metabolism of these compounds in bacteria has been character.** (for 

reviews see. for example. Zalkin, H. and Dixon. J.E. (1992) -* novo purine nuc eohde 
biosynthesis", in: Progress in Nucleic Acid Research and Molecular Bidogy, voL 42, 
Academic Press:, p. 259-287; and Michal, G. (1999) "Nucleotides ^ u " s • 
Chapter 8 in: Biochemical Pathways: An Atlas of Biochemistry and Molecular Biology;, 
35 Willy: New York). Purine metabolism has been the subject of intensive ; research, and is 
essential to the normal functioning of the cell. Impaired purine metabolism in h.gher 
animals can cause severe disease, such as gout. Purine nucleotides are synthesized from 
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ribose-5-phosphate. in a scries of steps through the intermediate compound mosine-5 - 
phosphate (IMP), resulting in the production of guanosmeS -monophosphate (GMP) or 
adenosines-monophosphate (AMP), from which the triphosphate forms utilized as 
nucleotides are readily formed. These compounds are also utilized as energy stores, so 
their degradation provides energy for many different biochemical processes m the celL 
Pyridine biosynthesis proceeds by the formation of uridine-5 "-monophosphate (UMP) 
from ribose-5-phosphate. UMP. in turn, is converted to cytidine-5'-triphosphate (CTP). 
The deoxy- forms of all of these nucleotides are produced in a one step reduction 
reaction from the diphosphate ribose form of the nucleotide to the diphosphate 
deoxyribose form of the nucleotide. Upon phosphorylation, these molecules are able to 
participate in DNA synthesis. 



20 



D Trehalose Metabolism and Uses 

Trehalose consists of two glucose molecules, bound in a. a-1 ,1 linkage. It is 
15 commonly used in the food industry as a sweetener, an additive for dried or frozen 
foods, and in beverages. However, it also has applications in the pharmaceutical 
cosmetics and biotechnology industries (see. for example. Nishimoto et al ^(1998) -U.S. 
Patent No. 5.759.610; Singer, M-A- and Lindquist, S. (1998) Trends Biotech. 16: 460- 
467- Paiva. CL.A. and Panek, A.D. (1996) Biotech. Ann. Rev. 2: 293-314; and 
Shiosaka, M. (1997) J. Japan 1 72: 97-102). Trehalose is produced by enzymes from 
many microorganisms and is naturally released into the surrounding medium, from 
which it can be collected using methods known in the art. 

IT noetic Stability: P r ^in Synthesis and Protein Secretion in C. glutomicum 

^ production 0 f a desired compound from a cell such as C gluiamicum is the 

culmination of a large number of separate yet interrelated processes each of which is 
critical to the overall production and release of the compound from the cell. In 
engineering a cell to overproduce one or more fine ^ 
given to each of these processes to ensure that the biochemical machinery of ^ cell wUl 
be compatible with such genetic manipulation. "^ h ^.° fP " e „ the 
importance include the stability of the altered gene(s) upon tntroduction into the celUhe 
ability of the mutated gene to be properly transcribed and translated (mcluding s ues of 
-d the ability of the mutant protein product to be appropnately folded 

and/or secreted. 
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A Bacterial Repair and Recombination Systems 

Cells are constantly exposed to nucleic acid-damaging agents, such as UV 
irradiation, oxygen radicals, and alkylation. Further, even the action of DNA 
polymerases is not error-free. Cells must maintain a balance between genetic stability 
(which ensures that genes necessary for vital cellular functions are not damaged dunng 
normal growth and metabolism) and genetic variability (which permits cells to adapt to a 
changing environment). Therefore, there exist separate, but interrelated pathways of 
DNA repair and DNA recombination in most cells. The former serves to stringently 
correct errors in DNA molecules by either directly reversing the damage or excising the 
damaged region and replacing it with the correct sequence. The latter recombination 
system also repairs nucleic acid molecules, but only those lesions that result in damage 
to both strands of DNA such that neither strand is able to serve as a template to correct 
the other. Recombination repair and the SOS response may readily lead to inversions, 
deletions, or other genetic rearrangements within or around the region of the damage, 
which in turn promotes a certain degree of genomic instability which may contribute to 
the ability of the cell to adapt to changing environments or stresses. 

High-fidelity repair mechanisms include direct reversal of DNA damage and 
excision of damage and resynthesis using the information encoded on the opposite DN A 
strand Direct reversal of damage requires an enzyme having an activity opposite of that 
which originally damaged the DNA. For example, inappropriate methylation of DNA 
may be corrected by the action of DNA repair methyltransferases, and nucleotide dimers 
created by UV irradiation may be fixed by the activity of deoxyribodip>-rimidine photo- 
lyase. which, in the presence of light cleaves the dimer back to its constituent 
nucleotides (see Michal, G- (1999) Biochemical Pathways: An Atlas of Biochemistry 
and Molecular Biology, Wiley: New York, and references therein). 

Precise repair of more extensive damage requires specialized repair mechanisms. 
These include the mismatch repair and excision repair systems. Damage to a smgle base 
may be corrected by a series of cleavage reactions, where first the sugar-base bond is 
cut. followed by cleavage of the DNA backbone at the site of damage and removal of 
the damaged base itself. Finally. DNA polymerase and DNA ligase act to fill in and seal 
the gap using the second DNA strand as a template. More significant DNA damage 
which results in altered conformation of the double helix is corrected by the ABC 
system, in which helicase II. DNA polymerase I. UvrA. UvrB. and UvrC proteins 
combine to nick the double helix at the site of damage, to unwind the damaged repon in 
an ATP-dependent fashion, to excise the damaged region, and to fill m the missmg 
region using the other strand as a template. Lastly, DNA ligase seals the mc *" W 
repair systems also exist for G T mismatches (involving the Vsr protem) and for small 
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deleti on/insertion errors resulting in mispairing of the two strands (involving the 
methylation-directcd pathway). 

There also exist low-fidelity repair systems which are generally used to correct 
very extensive DNA damage in bacteria. Double-strand repair and recombination 
occurs in the presence of a lesion which affects both strands of DNA. In this situation, it 
is impossible to repair the damage utilizing the other strand as the template. Thus, this 
repair system involves a double-crossover event between the area of the lesion and 
another copy of the region on a homologous DNA molecule. This is possible because 
bacteria divide so rapidly that a second copy of genomic DNA is usually available 
before actual cell division occurs. This crossover event may readily lead to inversions, 
duplications, deletions, insertions and other genetic rearrangements, and thus increases 
the overall genetic instability of the organism. 

The SOS response is activated when sufficient damage is present in the DNA 
that DNA polymerase III stalls and cannot continue replication. Under these 
circumstances, single-stranded DNA is present. The RecA protein is activated by 
binding to single-stranded DNA. and this activated form results in the activation of the 
LexA repressor, thereby lifting the transcriptional block of more than 20 genes, 
including UvrA. UvrB. UvrC. helicase TI. DNA pol III. UmuC, and UmuD. The 
combined activities of these enzymes results in sufficient filling of the gap region that 
DNA pol III is able to resume replication. However, these gaps have been filled in with 
bases which should not be present; thus, this type of repair results in error-prone repair, 
contributing to overall genetic instability in the cell. 



B. Transposons 

The aforementioned systems, whether high or low fidelity, exist to repair DNA 
damage. In certain circumstances, this repair may accidentally incorporate additional 
genetic rearrangements. Many bacterial cells also have mechanisms specifically 
designed to cause such genetic rearrangements. Particularly well-known examples of 
such mechanisms are the transposons. 

Transposons are genetic elements which are able to move from one site to 
another either within a chromosome or between a piece of extrachromosomal DNA 
(e.g., a plasmid) and a chromosome. Transposition may occur in multiple ways; for 
example, the transposable element may be cut out from the donor site and integrated into 
the target site (nonreplicative transposition), or the transposable element may alternately 
be duplicated from the donor site to the target site, yielding two copies of the element 
(replicative transposition). There is generally no sequence relationship between the 
donor and target sites. 
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There are a variety of results possible from such a ttansposiuon event. The 
integration of a tiansposable element into a gene disrupts the gene, usually abrogating «s 
furlion entirely. An integration even, that occurs in ate DNA sunoundmg a gene may 
no, perturb *e coding sequence i,self, bu, can have a profound effect on the regulation 
of <he gene and thus, on its expression. Recombination even* berween two cop.es of a 
fransposable clement found in different portions of me genome may result^ » . «««. 
duplications, inversions, .repositions, or amplifications of segments of <he genome. 1, 
is also possible for different repliconsto fuse. 

The simplest transposon-Iike genetic elements are termed insertion OS) 
element IS elements contain a nudcotidc region of varying length (though usuaUy less 
than 1 500 bases) lacking any coding regions, surrounded by inverted repeal at emter 
end Thus, since the IS element does no, encode any proteins whose activuy may be 
detected, the presence of an IS element is generally only observed due to a loss of 
function of one or more genes in which the IS element is inserted. 

Transpose are mobile genetic elements which, unlike IS elements, contiun 
nucleic acid sequences bounded b, repeats which may encode one or more protons. It 
"no, unusual for these repeat regions to consis, of IS dements. The prote.ns encoded 
by <he transposon are typically transposes (proteins which catalyze me movement of 
the rransposon ta one site to another) and antibiotic resistance genes. The 
mechaniL and regulation of tiansposable dements are we» known m me an and are 
ravebeendescnbeda.leas,in.fbre^p.e.Lengelere,al.(1999)B,o.o^ 

coli and Sal ~mOa. ASM Press: Washington, D C, Sonenshem. 

(1993). Baa,,^, ASM Press: Washing**,. D.C, v«L D and Voe, «i. 1992) 

Biochemie. VCH: Weinheim. p. 985-990; Brock. T.D.. and Mad.gan. 

Bioiogy of Microorganism,. 6* ed.. Prentice Hall: New York. p *M». .mi Keener. 

I <19W "Regulation of transposition in bacteria". Re*. ^ « ■ »™27. 



) 



5 



^nscriptional apparatits consists of a number of proteins mat can be d.v,ded m«o «wo 
pJ£nerase (the processive DNA-transcribmg enzyme) 

r„am positive bacteria such as Corynebacteria contain only one type of RKA- 

« • varies of different sigma fatfors specific for different promoters. 
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growth phases, environmental conditions, substrates, oxygen levels, transport processes, 
and the like, which permits adaptability of the organism to different environmental and 

metabolic conditions. 

Promoters are specific DNA sequences that serve as docking sites for the RNA 
5 polymerase holoenzyme. Many promoter elements possess conserved sequence 
elements that may be recognized through homology searches: alternately, promoter 
regions for a particular gene may be identified using standard techniques such as primer 
extension. Many promoter regions from gram-positive bacteria are known (see, e.g., 
Sonenshein. AX.. Hoch. J.A.. and Losick, R.. eds. (1 993) Bacillus subtile ASM Press: 

10 Washington, D.C). 

Promoter transcriptional control is influenced by several mechanisms of 
repression or activation. Specific regulatory proteins which bind promoters have the 
ability to block (repressors) or to assist (activators) the binding of the RNA holoenzyme, 
and thus to regulate transcription. The binding of these repressor and activator 

1 5 molecules in turn is regulated by their interactions with other molecules, such as 

proteins or other metabolic compounds. Transcription may alternately be regulated by 
factors influencing processes such as elongation or termination (see. e.g., Sonenshein, 
A.L.. Hoch. J.A.. and Losick, R.. eds. (1993) Bacillus subtilis. ASM Press: Washington, 
D C). The ability to regulate transcription of genes in response to a variety of 

20 environmental or metabolic cues affords cells the ability to tightly control when a gene 
may be expressed and or bow much of a gene product may be present in the cell at one 
time. This in turn prevents unnecessary expenditure of energy or unnecessary utilization 
of possibly scarce intermediate compounds or cofactors. 

25 D Translation and tRNA-Aminoacyl Synthetases 

Translation is the process by which a polypeptide is synthesized from amino 
acids according to the information contained within an mRNA molecule. The main 
components of this process are ribosomes and specific initiation or elongation factors 
such as IF1-3. EF-G, and EFTu (see, e.g., Sonenshein. A.L., Hoch, J..A.. Losick, R, eds. 

30 (1 993) Bacillus subtilis, ASM Press: Washington. D.C. ). 

Each codon of the mRNA molecule encodes a particular amino acid. The 
conversion from mRNA to amino acid is effected by transfer RNA (tRNA) molecule. 
These molecules consist of a single strand of RNA (between 60 and 100 bases), which 
exists in an L-shaped three dimensional structure having protruding areas, or arms . 

35 One such arm forms base pairs with a particular codon sequence on the mRNA 

molecule. Asecondarm i,^^,^^^?'^^^^^ 
encoded by the codon). Other arms of the tRNA include the variable arm. the T V C arm 
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(which bears thimidylate and pseudouridylatc modifications), and the D arm (which 
bears a dihydrouridine modification). The function of these latter structures remains 
unknown, but their conservation between tRNA molecules suggests a role in protein 
synthesis. 

5 In order for the nucleic acid-based tRNA molecule to associate with the correct 

amino acid, a family of enzymes, termed the aminoacyl-tRNA synthetases, must act. 
There exist many different of these enzymes, each of which is specific for a particular 
tRNA and a particular amino acid. These enzymes link the 3* hydroxyl of the terminal 
tRNA adenosine ribose moiety to the amino acid in a two step reaction. First, the 
1 0 enzyme is activated by reaction with ATP and the amino acid to result in an aminoacyl- 
tRNA synthetase-aminoacyl adenylate complex. Second, the aminoacyl group is 
transferred from the enzyme to the target tRNA where it remains in the high-energy 
state. Binding of the tRNA molecule to its cognate codon on the mRNA molecule then 
brings the high-energy amino acid attached to the tRNA into contact with the ribosome. 
15 Within the ribosome. the amino-acid charged tRNA (aminoacyl-tRNA) occupies one 
binding site (the A site) adjacent to a second site (the P site) containing a tRNA 
molecule whose amino acid arm is attached to the nascent polypeptide chain (peptidyl- 
tRNA). The activated amino acid on the aminoacyl-tRNA is sufficiently reactive that a 
peptide bond spontaneously forms between this amino acid and the next amino acid on 
20 the nascent polypeptide chain. Hydrolysis of GTP provides the energy for the transfer of 
the now-polypeptide chain-loaded tRNA from the A site to the P site of the ribosome. 
and the process repeats until a stop codon is reached. 

There are a number of different steps at which translation may be regulated. 
These include the binding of the ribosome to mRNA. the presence of mRNA secondary 
25 structure, codon usage, or the abundance of particular tRNAs. Also, special regulation 
- mechanisms such as attenuation may act at the level of translation. For an uwiepth 
review of many of these mechanisms, see, e.g., Vellanoweth. R.L. (1993) "Translation 
and its Regulation" in: Bacillus subtilis and other Gram Positive Bacteria. Sonenshein, 
A.L. et al., eds., ASM Press: Washington DC, p. 699-71 1. and references cited therein. 

30 

E Protein Folding and Secretion 

Synthesis of proteins by the ribosome results in polypeptide chains, which must 
take on a three-dimensional form before the protein can function normally. This three- 
dimensional structure is achieved by a process of folding. Polypeptide chains are 
35 flexible, and (in principle) move readily and freely in solution until they attain a 
conformation which results in a stable three-dimensional structure. However, rt is 
sometimes difficult for proteins to fold correctly, either due to environmental conditions 
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(e.g., high temperature, where the extra kinetic energy present in the system makes it 
more difficult for the polypeptide to settle in the energy well of a stable structure) or due 
to the nature of the protein itself (e.g., the hydrophobic regions in nearby polypeptides 
have a tendency to aggregate and thereby sequester themselves from aqueous solution). 
5 Proteinaceous factors have been identified that are able to catalyze, chaperone, or 

otherwise assist in the folding of proteins being synthesized either co- or 
posttranslalionally. Members of these protein folding molecules are the prolyl-peptidyl 
isomerases (e.g.. trigger factor, cyclophilin. and FKBP homologs). and proteins of the 
heat shock protein group (e.g., DnaK. DnaJ, GroEL, small heat shock proteins. HtpG 

10 and members of the Clp family (e.g.. ClpA. ClpB. ClpW. ClpP. and ClpX)). Many of 
these proteins are essential for the viability of cells: in addition to their functions in 
protein folding, translocation, and processing, they frequently serve as key targets for 
/ the overall regulation of protein synthesis (see. e.g.. Bukau, B.. (1993) Molecular 

Microbiology 9(4): 671-680; Bukau. B.. and Horwich. A.L. (1998) Cell 92(3):351-366; 

15 Hesterkamp. T.. Bukau, C. (1996) FEBS Lett. 389(l):32-34; Yaron, A.,Naider, F. 

(1993) Critical Reviews in Biochemistry and Molecular Biology 28(1):31-81 ; Scheibel, 
R.. Buchner, J. (1998) Biochemical Pharmacology 56(6):675-682; Ellis. R.J., Haiti, F.U. 
(\996) FASEB Journal 10(1): 20-26: Wawrzynow. A. et al. (1996) Molecular 
Microbiology 21(5): 895-899; Ewalt, K.L.. et al. (1997) Cell 90(3): 491-500). 

20 Chaperones identified thus far function in one of two ways: they either bind and 

stabilize polypeptides, or they provide an environment in which folding may occur 
without interference. The former group, including, e.g., DnaK, DnaJ. and the heat shock 
proteins, bind directly to the nascent or misfoided polypeptide, frequently with 
concomitant ATP hydrolysis. The association of the chaperone prevents the polypeptide 

25 from aggregating with other polypeptides, and can force such aggregates to dissipate if 
they have already formed. After interaction with a second chaperone, GrpE (which 
permits an ADP-ATP exchange to occur), the polypeptide is released in a molten 
globule state and is permitted to fold. If misfolding occurs, the chaperones again 
associate with the misfoided protein, forcing it to return to an unfolded state. This cycle 

30 may be repeated until the protein is correctly folded. Unlike the first type of chaperones, 
which simply bind to the polypeptide, the second group (e.g. GroEL/ES) not only bind 
to the polypeptide, but also completely surround it such that it is protected from the 
surrounding environment. The GroEL/ES complex is composed of 2 stacked 14- 
member rings having a hydrophobic interior surface, and a 7-membered ring 4 cap\ The 

35 polypeptide is drawn into the channel in the center of this complex in an ATP-dependent 
reaction where it is able to fold without interference from other polypeptides. 
Incorrectly folded proteins are not released from the complex. 
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An important step in protein folding is the creation of disulfide bonds. These 
bonds either within a subunit or between subunits of a protein, are critical for protein 
stability. Disulfide bonds form readily in aqueous solution, and incorrect disulfide bond 
formation is difficult to reverse without the aid of a reducing environment. To assist in 
5 this process of correct disulfide bond formation, thiol -containing molecules, such as 
glutathione or thioredoxin, and their respective oxidation/reduction systems arc found in 
the cytosol of most cells (Loferer, H., Hennecke. H. ( 1 994) Trends in Biochemical 

Sciences 19(4): 169-171). 

There are times, however, when folding of nascent polypeptide chams is not 
10 desirable, such as when these polypeptides are to be secreted. The folding process 
generally results in the hydrophobic regions of the protein being in the center of the 
protein, away from aqueous solution, and the hydrophilic regions being presented at the 
outer surfaces of the protein. This conformational arrangement while creating greater 
stability for the protein, makes it difficult for the protein to be translocated across 
1 5 membranes, since the hydrophobic core of the membrane is inherently incompat.ble 
with the hydrophilic exterior of the protein. Thus, proteins synthesized by the cell 
which must be secreted to the exterior of the cell (e.g.. cell surface enzymes and 
membrane receptors) or which must be inserted into the membrane itself (e.g., 
transporter proteins and channel proteins) are generally secreted or inserted prior to 
20 foldinc The same chaperones which prevent aggregation of nascent polypeptide chains 
also prevent folding of polypeptides until they are disengaged. Thus, these proteins may 
'escort" nascent polypeptide chains to an appropriate cellular location where they either 
are removed, thereby permitting folding, or they transfer the polypeptide to a transport 
system which will either secrete the polypeptide or aid its insertion into a membrane. 
25 A specialized protein machinery has evolved that specifically detects, binds, 

transports, and processes proteins bearing specific prosequences (these sequences are 
later removed from the protein by cleavage). This machinery consists of a number of 
proteins which are collectively termed the sec (type II secretion) system (for review, see 
Gilbert, M ct al. (1995) Critical Review in Biotechnology 15(1): 13-39 and references 
30 therein: Freudl. R. (1992) Journal of Biotechnology 23(3): 231-240 and references 
therein; Neidhardt. F.C. ct al. (1996) E. coli and Salmonella ASM Press: 
D C p 967-978; BineL R. et al. (1997) Gene 192(1): 7-11: and Rapoport, T.A. (1986) 
Critical Review in Biochemistry 20(1): 73-1 37, and references therein). The sec system 
is composed of chaperones (e.g., SecA and SecB). integral membrane proteins, also 
35 called translocases (e.g.. SecY, SecE, and SecG). and signal peptidases LepB). 
The nascent polypeptide having a prosequence directing secretion is bound by SecB 
which delivers it to SecA at the inner surface of the cell membrane. Sec A binds to the 
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prosequence and. upon ATP hydrolysis, inserts into the membrane and forces a portion 
of the polypeptide through the membrane as well. The remainder of the polypeptide is 
guided through the membrane by a complex of translocases, such as SecY. SecE. and 
SecG. Finally, the signal peptidase cleaves off the prosequence and the polypeptide is 
free on the extracellular side of the membrane, where it spontaneously folds. 

Sec-independent secretion mechanisms are also known. For example, the signal 
recognition particle-dependent pathway involves the binding of a signal recognition 
particle (SRP) protein to the nascent polypeptide as it is being synthesized, forcing the 
ribosome to stall. A receptor for SRP at the inner surface of the membrane then binds 
the ribosomc-polypeptide-SRP complex. Hydrolysis of GTP provides the energy 
necessary to transfer the complex to the sec translocase complex, where the nascent 
polypeptide is guided across the membrane as it is synthesized by the ribosome. Other 
secretion mechanisms specific to only a few proteins are also known to exist. 



III. Elements and Methods of the Invention 

The present invention is based, at least in part, on the discovery of novel 
molecules, referred to herein as SES nucleic acid and protein molecules, which 
participate in C. glutamicum DNA repair or recombination, in the transposition or other 
rearrangement of C glutamicum DNA, in C glutamicum gene expression (e.g.. the 
processes of transcription or translation), or in protein folding or protein secretion from 
this microorganism. In one embodiment the SES molecules participate in the repair or 
recombination of DNA. in the transposition of genetic material, in gene expression (i.e.. 
the processes of transcription or translation), in protein folding, or in protein secretion in 
Corynebacterium glutamicum. Tn a preferred embodiment the activity of the SES 
molecules of the present invention with regard to DNA repair or recombination, 
transposition of DNA. gene expression, protein folding or protein secretion has an 
impact on the production of a desired fine chemical by this organism. In a particularly 
preferred embodiment the SES molecules of the invention are modulated in activity, 
such that the C. glutamicum cellular processes in which the SES molecules participate 
(e.g.. DNA repair or recombination, transposition of DNA. gene expression, protein 
folding, or protein secretion) are also altered in activity, resulting either directly or 
indirectly in a modulation of the yield, production, and/or efficiency of production of a 
desired fine chemical by C. glutamicum. 

The language, "SES protein" or "SES polypeptide'" includes proteins which 
participate in a number of cellular processes related to C. glutamicum genetic stability, 
gene expression, protein folding, or protein secretion. For example, an SES protein may 
be involved in C. glutamicum DNA repair or recombination mechanisms, in 
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rearrangements of C. glutamicum genetic material (such as those mediated by 
transposons), in transcription or translation of genes in this microorganism, in the 
mediation of C. glutamicum protein folding (such as the activity of chaperones) or m 
secretion of proteins from C glutamicum cells (e.g., the sec system). Examples of SES 
5 proteins include those encoded by the SES genes set forth in Table 1 and Appendix A. 
The terms "SES gene" or "SES nucleic acid sequence' include nucleic acid sequences 
encoding an SES protein, which consist of a coding region and also corresponding 
untranslated 5' and 3' sequence regions. Examples of SES genes include those set forth 
in Table 1 . The terms "production" or "productivity" arc art-recognized and include the 
10 concentration of the fermentation product (for example, the desired fine chemical) 

formed within a given time and a given fermentation volume (e.g., kg product per hour 
per liter). The term "efficiency of production" includes the time required for a particular 
level of production to be achieved (for example, how long it takes for the cell to attain a 
particular rate of output of a fine chemical). The term "yield" or "product/carbon yield" 
15 is art-recognized and includes the efficiency of the conversion of the carbon source into 
the product (i.e.. fine chemical). This is generally written as. for example, kg product 
per kg carbon source. By increasing the yield or production of the compound, the 
quantity of recovered molecules, or of useful recovered molecules of that compound in a 
given amount of culture over a given amount of time is increased. The terms 
"biosynthesis" or a "biosymhetic pathway" are art-recognized and include the synthesis 
of a compound, preferably an organic compound, by a cell from intermediate 
compounds in what may be a multistep and highly regulated process. The terms 
"degradation" or a "degradation pathway" are art-recognized and include the 
breakdown of a compound, preferably an organic compound, by a cell to degradation 
products (generally speaking, smaller or less complex molecules) in what may be a 
multistep and highly regulated process. The language "metabolism" is art-recogn.zed 
and includes the totality of the biochemical reactions that take place in an organism. 
The metabolism of a particular compound, then, (e.g.. the metabolism of an ammo acid 
such as glycine) comprises the overall biosynthetic, modification, and degradation 
pathways in the cell related to this compound. The term "DNA repair" is art-recogmzed 
and includes cellular mechanisms whereby errors in DNA (due either to damage, such 
as but not limited to, ultraviolet radiation, methylases. low-fidelity replication, or 
mutagens) are excised and corrected. The term "recombination" or "DNA 
recombination" is art-recognized and includes cellular mechanisms whereby extensive 
DNA damage affecting both strands of a DNA molecule is corrected by homologous 
recombination with another, undamaged copy of the DNA molecule witon the same 
^ Such repairs are generally low-fidelity, and may result in genetac rearrangements. 
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The term "transposon" is art-recognized and includes a DNA . k- ^ » *£» 
insert randomly throughout the genome of an organism, and wh,ch may result m the 
disruption of genes or their regulatory regions, or in duplications, mverstons. deletions, 
and other genetic rearrangements. The term "protein folding" is art-recogruzed and 
includes the movement of a polypeptide chain through multiple three-toenstonal 
configurations until the stable, active. three-dirnenaonal configuration ,s attiuned. The 
formation of disulfide bonds and the sequestration of hydrophobic regions from the 
surrounding aqueous solution provide some of the driving forces for ttas fo.dmg 
process, and correct folding may be enhanced by the activity of chaperones. The terms 
"secretion" or "protein secretion" is art-recognized and includes the movement of 
proteins from the interior of the cell to the exterior of the cell, in amechamsm whereby 
a system of secretion proteins permits their transit across the cellular membrane to the 

exterior of the cell. 

In another embodiment, the SES molecules of the invention are capable of 
modulating the production of a desired molecule, such as a fine chermcal. m a 
microorganism such as C. There are a number of mechamsms by wh.ch 

the alteration of an SES protein of the invention may directly affect the y,eld. 
production, and/or efficiency of production of a fine chemical i from a C g l~n, 
strain incorporating such an altered protein. For example, modulation of prote,ns 
involved directly in transcription or translation (e.g.. polymerases or nbosomes) such 
that they are increased in number or in activity should increase global cellular 
Ascription or translation ,or rates of these processes). This increased <«™ 
expressL should inc.ude those proteins involved in fine chenucal b,osy«hes, . so an 
increase in yield, production, or efficiency of production of one or more desrred 
Z£T«y occur. Modifications to the transcriptional/transUtiona. protetn 
IXcTof C -ch -hat *e regulation of these proteins .s altered may 

mdirec, Ration of production of one or more fine chemicals For example^ 
^inTthe number or activity of a DNA repair or recombination protem of the 
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invention. one may increase the ability of the cell to detect and repair DNA damage. 
This should effectively increase the ability of the cell to maintain a mutated gene within 
its genome, thereby increasing the likelihood that a transgene engineered into C 
glutamicum (e.g.- encoding a protein which will increase biosynthesis of a fine 
chemical) will not be lost during culture of the microorganism. Conversely, by 
decreasing the number or activity of one or more DNA repair or recombination proteins, 
it may be possible to increase the genetic instability of the organism. Such 
manipulations should improve the ability of the organism to be modified by mutagenesis 
without the introduced mutation being corrected. The same holds true for proteins 
involved in transposition or rearrangement of genetic elements in C glutamicum (e.g.. 
transposons). By mutagenizing these proteins such that they are either increased or 
decreased in number or activity, it is possible to simultaneously increase or decrease the 
genetic stability of the microorganism. This has a profound impact on the ability of any 
other mutation to be introduced into C. glutamicum. and on the ability of introduced 
5 mutations to be retained. Transposons also offer a convenient mechanism by which 
mutagenesis of C glutamicum may be performed; duplication of desired genes (e.g.. 
fine chemical biosynthetic genes) is readily accomplished by transposon mutagenesis, as 
is disruption of undesired genes (e.g.. genes encoding proteins involved in degradation 

of desired fine chemicals). 

10 By modulating one or more proteins (e.g. sigma factors) involved in the 

regulation of transcription or translation in response to particular environmental 
conditions, it may be possible to prevent the cell from slowing or stopping protein 
synthesis under unfavorable environmental conditions, such as those found in large- 
scale fermentor culture. This should lead to increased gene expression, which jn turn 

25 may permit increased biosynthesis of desired fine chemicals under such conditions. 
Many such secreted proteins have functions critical for cell viability (e.g., cell surface 
proteases or receptors). An alteration of a secretory pathway such that these proteins are 
more readily transported to their extracellular location may improve the overall [viability 
ofthe cell, and thus result in greater numbers of C glutamicum cells capable of 

30 producing fine chemicals during large-scale culture. Further, since certain bactenal 

protein secretion pathways (e.g., the sec system) axe known to participate « the insertion 
of integral membrane proteins (such as receptors, channels, pores, or transporters) into 
the membrane, the modulation of activity of proteins involved in protein secretion from 
C. glutamicum may affect the ability ofthe cell to excrete waste products or to import 

35 necessary metabolites. If the activity of these secretory proteins is increased, then the 
ability ofthe cell to produce fine chemicals may be similarly increased (due to ; an 
increase in the presence of transporters/channels in the membrane which may import 
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nutrients or excrete waste products). If the activity of these proteins is decreased, then 
there may be insufficient nutrients available to support overproduction of desired 
compounds, or waste products may interfere with fine chemical biosynthesis. 

The isolated nucleic acid sequences of the invention are contained within the 
5 genome of a Corynebacterium glutamics strain available through the Amencan Type 
Culture Collection, given designation ATCC 13032. The nucleotide sequence of the 
isolated C glutamicum SES cDNAs and the predicted amino acid sequences of the C. 
glutamicum SES proteins are shown in Appendices A and B. respectively. 
Computational analyses were performed which classified and/or identified these 
1 0 nucleotide sequences as sequences which encode proteins involved in the repair or 

recombination of DNA, in the transposition of genetic material, in gene express, on (i.e.. 
the processes of transcription or translation), in protein folding, or in protem secretion in 
Corynebacterium glutamicum. 

The present invention also pertains to F °teins which have an amino acid 
15 sequence which is substantially homologous to an amino acid sequence of Appendix B. 
As used herein, a protein which has an amino acid sequence which is substantially 
homologous to a selected amino acid sequence is least about 50% homologous to the 
selected amino acid sequence, e.g., the entire selected amino acid sequence. A protem 
which has an amino acid sequence which is substantially homologous to 
20 amino acid sequence can also be least about 50-60o/o, preferably at least about 60-70 * 
and more preferably at least about 70-80%, 80-90%. or 90-95%, and most preferably at 
least about 96%, 97%, 98%. 99% or more homologous to the selected amino acid 

sequence^ ^ ^ or a biologically active portion or fragment thereof of the 
25 invention can participate in the repai, or recombination of DNA, in the * ■ 

genetic material, in gene expression (i.e., the processes of transcnption or trans toon) « 
^otein folding, or in protein secretion in Corynebacterium glutam.cum. or have one or 
more of the activities set forth in Table 1 . 

Various aspects of the invention are described in ftrta detail in the foiling 

30 subsections: 

A Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated nudeic acd mo ecules that 
w y *u^««f xx/f>\\ a* nucleic 
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One aspect oi tne lnvwiuua , . . A 

encode SES p^pepudes ot bio.ogiea.lv active portions 

,«,„i;firation of SES-encoding nucleic acid (e.g.. 5t5» ur* a;. >» 
Sl^Sr is intLed ,0 indude DNA mo.eo.les (e.g.. cDNA « genomtc 
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DNA) and RNA molecules (e.g., mRNA) and analogs of *= DNA or RNA generated 
using nucleotide analogs. This .enn also encompasses untranslated sequtt.ee located a, 
both .he 3- and 5' ends of the coding region of the gene: at leas. about 100 nuclides 
of sequence upaream from the 5' end of .he coding region and a. leas, abou, 20 
5 nucleotides of sequence downstream from me 3'end of.be coding regron of the gene. 
Te nucleic acid molecule can be singlcstianded or doub.e-s*aoded but preferably s 
double-stranded DNA. An "isolated" nucleic acid molecule is one wtuch 
ft™ outer nucleic acid molecules which are present in the natural source of *e nude,c 
acid. Preferably, an "isolated" nucleic acid is free of sequences wb.ch natural^ flank 
, 0 me nucleic acid (i.e.. sequences located a, the 5' and T ends of the nude. ac.«,n *e 
genomic DNA of the organism from which «,e nucleic ac,d .s denved. Fo example, m 
various embodiment, the isolated SES nucleic acid molecule can contam less Aan 
about 5 kb. 4kb. 3kb. 2kb. 1 kb. 0.5 kb or 0.1 kb of nucleotide sequences winch 
naturally flank me nucleic acid molecule in genomic DNA of me cell from winch me 
„ nuc.eicacidisderi,e4(e. g .aC^— mcell). Moreover, an "rsolated nuclercacd 
molecule, such as a cDNA molecule, can be substantially free of Cher 
or culture medium when produced by recombinant teckmques. or chenucal precursors or 
other chemicals when chemically synthesized. .„„,,, 
A nucleic acid molecule of the present invention, e.g.. a nucle,c ac.d molecule 
20 having a nucleotide sequence of Appendix A. or a portion thereof can be «^<«« 
standL molecula, biology techniques and the sequence information prodded ta«L 
For example, a C ****** SES cDN A can be isolated from a C. g h,^m horary 

standard hybridation techniques (e.g.. as described ,n Sambrook. J.. ^ 

Ts, k »re la n, Cold Spring Harbor Laboratory Press. Cold Spnng Harbor. NY. 1 989). 
M^C aSl ^d molecule encompassing aU or a port-on of one of the sequences 
oTAPpIdix A can be isolated by ft, polymerase chain reaction using ohgonucleoude 
nrimers^estoed based upon mis sequence (e.g.. a nucleic acid molecule encompassmg 
30 of L sequent of Appendix A can be isolated by me polymerase 

30 Lection using Oligonucleotide primers design* based ^ ~ ^e g 
Appendix A). For example. mRNA can be isolate* from <^.*£?£? h *+' 
byThe guatudiniutn-miocyana.e extraction procedure of Ourgwm e. al. (V 979) 

»• i m 1 8- 5294-5299) and cDN A can be prepared usmg reverse transcriptase 

Biochemistry J 5. Dzy^oz^*; ^ x r MD' 

35 (eg Moloney MLV reverse transcriptase. avaUable from <*^f ^f^^ 

or AMV reverse transeripn.se. available from Seikagaku Amenca. Inc.. * M"*"* 
or AM* reverse u<uu v ^i^~ a <; e chain reaction amplification can 

FL). Synthetic oligonucleotide pnmers for polymerase chain 
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be designed Used upon one of the nucleotide sequences ,W in Appendix A. A 
It^id of the invention c» be amplified using cDNA or, "^^T^X 
DNA, a template and appropriate oligonucleotide primers accordmg to standard PCR 
amplificauon techniques. The nudeic acid so amplified car. be 
appropriate vector and characterized by DNA sequence analysts. Furthermore 

, a „.^ »n CFS nucleotide sequence can be prepared by 

oligonucleotides corresponding to an SE5. nuciconue » h 

staLard synthetic techniques, e.g.. using an automated DNA synthesizer 

InTpreferred embodiment. » iso.ated nucleic acid molecule of the inven ion 
comprises one of me nuc.eotide sequences shown in Appendix £ T« sequences of 
Appendix A correspond to the C«5«*«**« **— «- * ES «W As of the 

,„„ This cDNA comprises sequences encoding SES proteins (i.e.. the coding 
: ind^ ^, hZuence to Appendix A,, as wel, as S untranslated sequences 
S 3 unu^ sequences, also indicated in Appendix A. Alternatively. *e nucleic 
Sl nrS can comprise only the coding region of any of the sequences ,n Appendix 

A ' For me purposes of this application, it -HI be understood that each of the 
fences set forth in Appendix A has an identifying KK A number having the 
dXa. " «£- » * <W* 0*. ^AOOOOS,. Each of ^ 

crises up to three paxts: a 5" upstream region, a coding region, and a do— n 
regir ich of these *ree regions is identified by the same KXA des.gnat.on ^ 
elLinate confusion. The recitation "one of the sequences » Append ^*^» 
to any of the sequences in Appendix A. which may be distinguished by <™ 
^designations. The coding region of each of 
corresponding amino acid sequence, which is set form .n Appendix B. ™* J^™ 
5 A^dix B I identified by the same KXA designations as »* 
c^dUy—d. 

designated RXA00005 is a translation of the cooing region oi 
nucleic acid molecule RXA00005 in he invCTtioI1 not 

,„ one embodiment the mic eic : ac„i ""^J^, ^ genc . . mfm . 

fnr this eene was publ shed m ^ehrmann, et ai. u*™ J * 
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coding region. nlir i-ic acid molecule of the 

In another preferred embodiment, an isolated nucleic acid m 

mention comprises a nucleic acid molecule which is a complement of one 
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nuclcotidc sequences shown in Appendix A. or a portion thereof. A nucleic acid 
molecule which is complementary to one of the nucleotide sequences shown in 
Appendix A is one which is sufficiently complementary to one of the nucleotide 
sequences shown in Appendix A such that it can hybridize to one of the nucleotide 
sequences shown in Appendix A. thereby forming a stable duplex. 

In still another preferred embodiment, an isolated nucleic acid molecule of the 
invention comprises a nucleotide sequence which is at least about 50-60%. preferably at 
least about 60-70%, more preferably at least about 70-80%, 80-90%, or 90-95%, and 
even more preferably at least about 95%, 96%, 97%, 98%, 99% or more homologous to 
a nucleotide sequence shown in Appendix A, or a portion thereof. In an additional 
preferred embodiment, an isolated nucleic acid molecule of the invention comprises a 
nucleotide sequence which hybridizes, e.g.. hybridizes under stringent conditions, to one 
of the nucleotide sequences shown in Appendix A, or a portion thereof. 

Moreover, the nucleic acid molecule of the invention can comprise only a 
portion of the coding region of one of the sequences in Appendix A. for example a 
fragment which can be used as a probe or primer or a fragment encoding a biologically 
active portion of an SES protein. The nucleotide sequences determined from the cloning 
of the SES genes from C glutamicum allows for the generation of probes and primers 
designed for use in identifying and/or cloning SES homologues in other cell types and 
organisms, as well as SES homologues from other Corynebacteria or related species. 
The probe/primer typically comprises substantially purified oligonucleotide. The 
oligonucleotide typically comprises a region of nucleotide sequence that hybridizes 
under stringent conditions to at least about 12. preferably about 25. more preferably 
about 40, 50 or 75 consecutive nucleotides of a sense strand of one of the sequences set 
forth in Appendix A. an anti-sense sequence of one of the sequences set forth in 
Appendix A. or naturally occurring mutants thereof. Primers based on a nucleotide 
sequence of Appendix A can be used in PCR reactions to clone SES homologues. 
Probes based on the SES nucleotide sequences can be used to detect transcripts or 
genomic sequences encoding the same or homologous proteins. In preferred 
embodiments, the probe further comprises a label group attached thereto, e.g. the label 
group can be a radioisotope, a fluorescent compound, an enzyme, or an enzyme co- 
factor. Such probes can be used as a part of a diagnostic test kit for identifying cells 
which miscxpress an SES protein, such as by measuring a level of an SES-encoding 
nucleic acid in a sample of cells, e.g.. detecting SES mRNA levels or determining 
whether a genomic SES gene has been mutated or deleted. 

In one embodiment the nucleic acid molecule of the invention encodes a protein 
or portion thereof which includes an amino acid sequence which is sufficiently 
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homologous to an amino acid sequence of Appendix B such that the protein or portion 
thereof maintains the ability to participate in the repair or recombination of DNA, in the 
transposition of genetic material, in gene expression (i.e., the processes of transcription 
or translation - ), in protein folding, or in protein secretion in Corynebacterium 
glutamicum. As used herein, the language "sufficiently homologous" refers to proteins 
or portions thereof which have amino acid sequences which include a minimum number 
of identical or equivalent (e.g.. an amino acid residue which has a similar side chain as 
an amino acid residue in one of the sequences of Appendix B) amino acid residues to an 
amino acid sequence of Appendix B such that the protein or portion thereof is able to 
participate in the repair or recombination of DNA, in the transposition of genetic 
material, in gene expression (i.e.. the processes of transcription or translation), in protein 
folding, or in protein secretion in Corynebacterium glutamicum. Proteins involved in C 
glutamicum genetic stability, gene expression, protein folding or protein secretion, as 
described herein, may play a role in the production and secretion of one or more fine 
chemicals. Examples of such activities are also described herein. Thus, "the function of 
an SES protein" contributes either directly or indirectly to the yield, production, and/or 
efficiency of production of one or more fine chemicals. Examples of SES protein 
activities are set forth in Table 1 . 

In another embodiment, the protein is at least about 50-60%. preferably at least 
about 60-70%, and more preferably at least about 70-80%, 80-90%. 90-95%. and most 
preferably at least about 96%. 97%, 98%. 99% or more homologous to an entire amino 
acid sequence of Appendix B. 

Portions of proteins encoded by the SES nucleic acid molecules of the invention 
are preferably biologically active portions of one of the SES proteins. As used herein, 
the term "biologically active portion of an SES protein" is intended to include a portion, 
e.g., a domain/motif, of an SES protein that participate in the repair or recombination of 
DNA. in the transposition of genetic material, in gene expression (i.e.. the processes of 
transcription or translation), in protein folding, or in protein secretion in 
Corynebacterium glutamicum. or has an activity as set forth in Table 1. To determine 
whether an SES protein or a biologically active portion thereof can participate in the 
repair or recombination of DNA. in the transposition of genetic material, in gene 
expression (i.e.. the processes of transcription or translation), in protein folding, or in 
protein secretion in Corynebacterium glutamicum. an assay of enzymatic activity may 
be performed. Such assay methods are well known to those skilled in the art. as detailed 
in Example 8 of the Exemplification. 

Additional nucleic acid fragments encoding biologically active portions of an 
SES protein can be prepared by isolating a portion of one of the sequences in Appendix 
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• - „- ^*rr — rr: — 

nurleotidc seaucnces shown m Appenaix m. inuuv - _ 

amino acid sequence shown m Append.* B_ ^ which is 

^^r^rC^sL — sequencesshownmAPP^ 
A itwiUbe appreciated * *°*= * lled in the 311 ** DNA 5CqUH1Ce ^ lym0t > h,Sm5 

, SKmottL sTchnaturalvariation^typicaUyrcsui.inl-SV.vananccmA, 
) SES prolan, sucn na nucleotide vanauons and 

nucleotide sequence of the SES gene. Any an ^ 
resulting amino acid polymorphisms in SES that are the result o 
ft* do not alter the functional activity of SES proteins are mtended to be wnthm the 
scope of the invention. ,„ Iamii:lim 
Hucleic acid molecules =™^'^ v ^ on tan * elated based on 
homologues of the C. sta— SK *^* c id disclo5e d herein using the C. 

JO another embodiment, an isolated nucletc acid molecule ot tn 

nucleotides in -ength and hybridizes under -^H^ the 
rnoleculecom^anucleotidesequenceofApoentoAIno 

. » - . ,„„ , 0 <n 100 250 ot more nucleotides in lcngro- 
nucletc acid >s at least 30. 50, 10U. ^ describc conditions for 

the term "hybridizes under smngen. condmons « homologous 
35 hybridisation and washing under which ™^2 ™lL ** editions are 
to each other typically remain hybridized to each * f^*. 
such that sequences at least about 65%. more preferably at leas, 
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more preferably at least about 75% or more homologous to each other typically remain 
hybridized to each other. Such stringent conditions arc known to those skilled in the art 
and can be found in Current Protocols in Molecular Biology. John Wiley & Sons. N.Y. 
(1989), 6.3.1-6.3.6. A preferred, non-limiting example of stringent hybridization 

5 conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45 °C. 
followed by one or more washes in 0.2 X SSC, 0.1% SDS at 50-65°C Preferably, an 
isolated nucleic acid molecule of the invention that hybridizes under stringent conditions 
to a sequence of Appendix A corresponds to a naturally-occurring nucleic acid 
molecule. As used herein, a "naturally-occurring" nucleic acid molecule refers to an 

1 0 RNA or DNA molecule having a nucleotide sequence that occurs in nature (e.g.. 

encodes a natural protein). In one embodiment the nucleic acid encodes a natural C. 
glutamicum SES protein. 

In addition to naturally-occurring variants of the SES sequence that may exist in 
the population, the skilled artisan will further appreciate that changes can be introduced 

1 5 by mutation into a nucleotide sequence of Appendix A. thereby leading to changes in the 
amino acid sequence of the encoded SES protein, without altering the functional ability 
of the SES protein. For example, nucleotide substitutions leading to amino acid 
substitutions at "non-essential" amino acid residues can be made in a sequence of 
Appendix A. A "non-essential" amino acid residue is a residue that can be altered from 

20 the wild-type sequence of one of the SES proteins (Appendix B) without altering the 
activity of said SES protein, whereas an "essential" amino acid residue is required for 
SES protein activity. Other amino acid residues, however, (e.g., those that are not 
conserved or only semi-conserved in the domain having SES activity) may not be 
essential for activity and thus are likely to be amenable to alteration without altering 

25 SES activity. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding SES proteins that contain changes in amino acid residues that are not essential 
for SES activity. Such SES proteins differ in amino acid sequence from a sequence 
contained in Appendix B yet retain at least one of the SES activities described herein. In 

30 one embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence 
encoding a protein, wherein the protein comprises an amino acid sequence at least about 
50% homologous to an amino acid sequence of Appendix B and is capable of 
participating in the repair or recombination of DNA. in the transposition of genetic 
material, in gene expression (i.e., the processes of transcription or translation), in protein 

35 folding, or in protein secretion in Corynehacierium glutamicum. or has one or more 
activities set forth in Table 1 . Preferably, the protein encoded by the nucleic acid 
molecule is at least about 50-60% homologous to one of the sequences in Appendix B. 
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morc preferably at least about 60-70% homologous to one of the sequences in Appendix 
B. even more preferably at least about 70-80%, 80-90%, 90-95% homologous to one of 
the sequences in Appendix B, and most preferably at least about 96%. 97%. 98%, or 
99% homologous to one of the sequences in Appendix B. 
5 To determine the percent homology of two amino acid sequences (e.g., one of 

the sequences of Appendix B and a mutant form thereof) or of two nucleic acids, the 
sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in 
the sequence of one protein or nucleic acid for optimal alignment with the other protein 
or nucleic acid). The amino acid residues or nucleotides at corresponding amino acid 

1 0 positions or nucleotide positions are then compared. When a position in one sequence 
(e.g.. one of the sequences of Appendix B) is occupied by the same amino acid residue 
or nucleotide as the corresponding position in the other sequence (e.g.. a mutant form of 
the sequence selected from Appendix B). then the molecules arc homologous at that 
position (i.e., as used herein amino acid or nucleic acid "homology" is equivalent to 

15 amino acid or nucleic acid "identity"). The percent homology between the two 

sequences is a function of the number of identical positions shared by the sequences 
(i.e., % homology = # of identical positions/total # of positions x 100). 

An isolated nucleic acid molecule encoding an SES protein homologous to a 
protein sequence of Appendix B can be created by introducing one or more nucleotide 

20 substitutions, additions or deletions into a nucleotide sequence of Appendix A such that 
one or more amino acid substitutions, additions or deletions are introduced into the 
encoded protein. Mutations can be introduced into one of the sequences of Appendix A 
by standard techniques, such as site-directed mutagenesis and PCR-mediated 
mutagenesis. Preferably, conservative amino acid substitutions are made at one or more 

25 predicted non-essential amino acid residues. A "conservative amino acid substitution" is 
one in which the amino acid residue is replaced with an amino acid residue having a 
similar side chain. Families of amino acid residues having similar side chains have been 
defined in the art. These families include amino acids with basic side chains (e.g.. 
lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), 

30 uncharged polar side chains (e.g., glycine, asparagine. glutamine, serine, threonine, 
tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, 
proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., 
threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, 
tryptophan, histidine). Thus, a predicted nonessential amino acid residue in an SES 

35 protein is preferably replaced with another amino acid residue from the same side chain 
family. Alternatively, in another embodiment, mutations can be introduced randomly 
along all or part of an SES coding sequence, such as by saturation mutagenesis, and the 
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resultant mutants can be screened for an SES activity described herein to identify 
mutants that retain SES activity. Following mutagenesis of one of the sequences of 
Appendix A, the encoded protein can be expressed recombinantly and the activity of the 
protein can be determined using, for example, assays described herein (see Example 8 of 
5 the Exemplification). 

In addition to the nucleic acid molecules encoding SES proteins described above, 
another aspect of the invention pertains to isolated nucleic acid molecules which are 
antisense thereto. An "antisense" nucleic acid comprises a nucleotide sequence which is 
complementary to a "sense" nucleic acid encoding a protein, e.g., complementary to the 

10 coding strand of a double-stranded cDNA molecule or complementary to an mRNA 
sequence. Accordingly, an antisense nucleic acid can hydrogen bond to a sense nucleic 
acid. The antisense nucleic acid can be complementary to an entire SES coding strand, 
or to only a portion thereof. In one embodiment an antisense nucleic acid molecule is 
antisense to a " coding region* of the coding strand of a nucleotide sequence encoding an 

1 5 SES protein. The term "coding region" refers to the region of the nucleotide sequence 
comprising codons which are translated into amino acid residues (e.g.. the entire coding 
region of SEQ ID RXA00005 comprises nucleotides 1 to 1608). In another 
embodiment, the antisense nucleic acid molecule is antisense to a "noncoding region" of 
the coding strand of a nucleotide sequence encoding SES. The term "noncoding region" 

20 refers to 5' and 3' sequences which flank the coding region that are not translated into 
amino acids (i.e., also referred to as 5' and 3' untranslated regions). 

Given the coding strand sequences encoding SES disclosed herein (e.g., the 
sequences set forth in Appendix A), antisense nucleic acids of the invention can be 
designed according to the rules of Watson and Crick base pairing. The antisense nucleic 

25 acid molecule can be complementary to the entire coding region of SES mRNA, but 
more preferably is an oligonucleotide which is antisense to only a portion of the coding 
or noncoding region of SES mRNA. For example, the antisense oligonucleotide can be 
complementary to the region surrounding the translation start site of SES mRNA. An 
antisense oligonucleotide can be, for example, about 5, 10. 15, 20, 25, 30, 35, 40, 45 or 

30 50 nucleotides in length. An antisense nucleic acid of the invention can be constructed 
using chemical synthesis and enzymatic ligation reactions using procedures known in 
the art. For example, an antisense nucleic acid (e.g.. an antisense oligonucleotide) can 
be chemically synthesized using naturally occurring nucleotides or variously modified 
nucleotides designed to increase the biological stability of the molecules or to increase 

35 the physical stability of the duplex formed between the antisense and sense nucleic 
acids, e.g.. phosphorothioate derivatives and acridine substituted nucleotides can be 
used. Examples of modified nucleotides which can be used to generate the antisense 
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nucleic acid include 5-fluorouraciK 5-bromouraciI, 5-chlorouracil. 5-iodouracil, 
hypoxanthine, xanthine, 4-acctylcytosine, 5-(carboxyhydroxylmethyl) uracil* 5- 
carboxymethylaminomethyl-2-thiouridine. 5-carbox>TnethylaminomethyJuraciL 
dihydrouracil. beta-D-galactosylqueosine, inosine. N6-isopentenyladenine. 1- 
5 methylguanine, 1-methylinosine, 2,2-dimethylguanine. 2-methyladenine, 2- 

methylguanine. 3-methylcytosine, 5-methylcytosine, N6-adcnine. 7-methylguanine, 5- 
methylaminomcthyluracil, 5-methoxyaminomethyl-2-thiouraciL beta-D- 
mannosylqueosine, 5'-methoxycarboxymethyluracil, 5 -methoxy uracil, 2-methylthio-N6- 
isopentcnyladenine, uracil-5-oxyacctic acid (v). wybutoxosine, pseudouracil. queosine, 

10 2-thiocytosine. 5-methyl-2-thiouracil, 2-thiouraciL 4-thiouracil. 5-methyluraci), uracil-5- 
oxyacetic acid methylester, uracil-5-oxyacetic acid (v). 5-methyl-2-thiouracil, 3-(3- 
amino-3-N-2-carboxypropyl) uracil, (acp3)w. and 2,6-diaminopurine. Alternatively, the 
antisense nucleic acid can be produced biologically using an expression vector into 
which a nucleic acid has been subcloned in an antisense orientation (i.e.. RNA 

1 5 transcribed from the inserted nucleic acid will be of an antisense orientation to a target 
nucleic acid of interest, described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered 
to a cell or generated in situ such that they hybridize with or bind to cellular mRNA 
and/or genomic DNA encoding an SES protein to thereby inhibit expression of the 

20 protein, e.g., by inhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for example, in the 
case of an antisense nucleic acid molecule which binds to DNA duplexes, through 
specific interactions in the major groove of the double helix. The antisense molecule can 
be modified such that it specifically binds to a receptor or an antigen expressed on a 

25 selected cell surface, e.g.. by linking the antisense nucleic acid molecule to a peptide or 
an antibody which binds to a cell surface receptor or antigen. The antisense nucleic acid 
molecule can also be delivered to cells using the vectors described herein. To achieve 
sufficient intracellular concentrations of the antisense molecules, vector constructs in 
which the antisense nucleic acid molecule is placed under the control of a strong 

30 prokaryotic, viral, or eukaryotic promoter are preferred. 

Tn yet another embodiment, the antisense nucleic acid molecule of the invention 
is an a-anomeric nucleic acid molecule. An ct-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary RNA in which, contrary to the 
usual J3-units, the strands run parallel to each other (Gaultier et al. (1987) Nucleic Acids 

35 Res. 1 5:6625-6641). The antisense nucleic acid molecule can also comprise a 2*-o- 
methylribonucleotide (Inoue et al. ( 1 987) Nucleic Acids Res. 1 5 :6 1 3 1 -6 1 48) or a 
chimeric RNA-DNA analogue (Inoue et al. (1987) FEBS Lett. 215:327-330). 
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In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity which are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA. to which they 
have a complementary region. Thus, ribozymes (e.g.. hammerhead ribozymes 

5 (described in Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to 

catalytically cleave SES mRNA transcripts to thereby inhibit translation of SES mRNA. 
A ribozyme having specificity for an SES-encoding nucleic acid can be designed based 
upon the nucleotide sequence of an SES cDNA disclosed herein (i.e.. RXA00O05 in 
Appendix A). For example, a derivative of a Telrahymena L-19 TVS RNA can be 

10 constructed in which the nucleotide sequence of the active site is complementary to the 
nucleotide sequence to be cleaved in an SES-encoding mRNA. See. e.g., Cech et al. 
U.S. Patent No. 4.987,071 and Cech et al. U.S. Patent No. 5.116,742. Alternatively, 
SES mRNA can be used to select a catalytic RNA having a specific ribonuclease 
activity from a pool of RNA molecules. See. e.g., Bartel, D. and Szostak. J.W. (1 993) 

15 Science 261:141 1-1 418. 

Alternatively, SES gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of an SES nucleotide sequence (e.g.. 
an SES promoter and/or enhancers) to form triple helical structures that prevent 
transcription of an SES gene in target cells. See generally. Helene. C. (1 991) Anticancer 

20 Dmg Des. 6(6):569-84; Helene. C. et al. (1992) Ann. N. Y. Acad. Sci. 660:27-36; and 
Maher. LJ. (1992) Bioassays 14(12):807-15. 

B Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression 

25 vectors, containing a nucleic acid encoding an SES protein (or a portion thereof). As 
used herein, the term "vector" refers to a nucleic acid molecule capable of transporting 
another nucleic acid to which it has been linked. One type of vector is a "plasmid". 
which refers to a circular double stranded DNA loop into which additional DNA 
segments can be ligated. Another type of vector is a viral vector, wherein additional 

30 DNA segments can be ligated into the viral genome. Certain vectors are capable of 
autonomous replication in a host cell into which they are introduced (e.g.. bacterial 
vectors having a bacterial origin of replication and episomal mammalian vectors). Other 
vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host 
cell upon introduction into the host cell, and thereby are replicated along with the host 

3 5 genome. Moreover, certain vectors are capable of directing the expression of genes to 
which they are operatively linked. Such vectors are referred to herein as "expression 
vectors". In general, expression vectors of utility in recombinant DNA techniques are 
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often in the form of piasmids. In the present specification, "plasmid" and "vector" can 
be used interchangeably as the plasmid is the most commonly used form of vector. 
However, the invention is intended to include such other forms of expression vectors, 
such as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno- 
5 associated viruses), which serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of 
the invention in a form suitable for expression of the nucleic acid in a host cell, which 
means that the recombinant expression vectors include one or more regulatory 
sequences, selected on the basis of the host cells to be used for expression, which are 

1 0 operatively linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean that the nucleotide sequence of 
interest is linked to the regulatory sequence(s) in a manner which allows for expression 
of the nucleotide sequence (e.g.. in an in vitro transcription/translation system or in a 
host cell when the vector is introduced into the host celt). The term "regulatory 

1 5 sequence" is intended to include promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel; Gene Expression Technology- Methods in Enzymology 1 85, 
Academic Press. San Diego. CA (1990). Regulatory sequences include those which 
direct constitutive expression of a nucleotide sequence in many types of host cell and 

20 those which direct expression of the nucleotide sequence only in certain host cells. It 
will be appreciated by those skilled in the art that the design of the expression vector can 
depend on such factors as the choice of the host eel] to be transformed, the level of 
expression of protein desired, etc. The expression vectors of the invention can be 
introduced into host cells to thereby produce proteins or peptides, including fusion 

25 proteins or peptides, encoded by nucleic acids as described herein (e.g., SES proteins, 
mutant forms of SES proteins, fusion proteins, etc.). 

The recombinant expression vectors of the invention can be designed for 
expression of SES proteins in prokaryotic or eukaryotic cells. For example, SES genes 
can be expressed in bacterial cells such as C glulamicum, insect cells (using baculovirus 

30 expression vectors), yeast and other fungal cells (see Romanos, M.A. et al. (1 992) 
"Foreign gene expression in yeast: a review". Yeast 8: 423-488; van den Hondcl. 
CA.MJ.J. et al. (1991) "Heterologous gene expression in filamentous fungi" in: More 
Gene Manipulations in Fungi, J.W. Bennet & LX. Lasure, eds.. p. 396-428: Academic 
Press: San Diego; and van den Hondel, C. A.M. J. J. & Punt. PJ. (1991) "Gene transfer 

35 systems and vector development for filamentous fungi, in: Applied Molecular Genetics 
of Fungi. Peberdy. J.F. et al., eds., p. 1-28, Cambridge University Press: Cambridge), 
algae and multicellular plant cells (see Schmidt R. and Willmitzer. L- (J 988) High 



A 
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efficiency Agrobacterium tumefaciens —mediated transformation of Arohidopsis 
thaliana leaf and cotyledon explants" Plant Cell Rep.: 583-586). or mammalian cells. 
Suitable host cells are discussed further in GoeddeL Gene Expression Technology: 
Methods in Enzymology 185. Academic Press. San Diego, CA (1990). Alternatively, the 
5 recombinant expression vector can be transcribed and translated in vitro, for example 
using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out with vectors 
containing constitutive or inducible promoters directing the expression of either fusion 
or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 

10 encoded therein, usually to the amino terminus of the recombinant protein but also to the 
C-terminus or fused within suitable regions in the proteins. Such fusion vectors 
typically serve three purposes: 1) to increase expression of recombinant protein; 2) to 
increase the solubility of the recombinant protein; and 3) to aid in the purification of the 
recombinant protein by acting as a ligand in affinity purification. Often, in fusion 

1 5 expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein 
from the fusion moiety subsequent to purification of the fusion protein. Such enzymes, 
and their cognate recognition sequences, include Factor Xa. thrombin and entcrokinase. 
Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith. 

20 D.B. and Johnson, K.S. (1988) Gene 67:31-40). pMAL (New England Biolabs, Beverly, 
MA) and pRIT5 (Pharmacia. Piscataway, NJ) which fuse glutathione S-transferasc 
(GST), maltose E binding protein, or protein A. respectively, to the target recombinant 
protein. In one embodiment, the coding sequence of the SES protein is cloned into a 
pGEX expression vector to create a vector encoding a fusion protein comprising, from 

25 the N-terminus to the C-terminus, GST-thrombin cleavage site-X protein. The fusion 
protein can be purified by affinity chromatography using glutathione-agarose resin. 
Recombinant SES protein unfused to GST can be recovered by cleavage of the fusion 
protein with thrombin. 

Examples of suitable inducible non-fusion £. coli expression vectors include 

30 pTrc (Amann et al.. (1988) Gene 69:301-315) and pET 1 Id (Studier et al.. Gene 
Expression Technology: Methods in Enzymology 185. Academic Press, San Diego. 
California (1 990) 60-89). Target gene expression from the pTrc vector relies on host 
RNA polymerase transcription from a hybrid trp-lac fusion promoter. Target gene 
expression from the pET 1 Id vector relies on transcription from a T7 gnJO-lac fusion 

35 promoter mediated by a coexpressed viral RNA polymerase (T7 gnl). This viral 

polymerase is supplied by host strains BL21(DE3) or HMS174(DE3) from a resident k 
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prophage harboring a T7gnl gene under the transcriptional control of the lacUV 5 
promoter. 

One strategy to maximize recombinant protein expression is to express the 
protein in a host bacteria *ith an impaired capacity to proteolytically cleave the 
5 recombinant protein (Gottesman, S.. Gene Expression Technology: Mvhods m 

Enzymology 185, Academic Press, San Diego. California (1990) 119-128). Another 
strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an 
expression vector so that the individual codons for each amino acid are those 
preferentially utilized in the bacterium chosen for expression, such as C. gluiamtcum 
0 \^ex ^992) Nucleic Acids Res. 20:2111-2118). Such alteration of nucleic ac.d 
sequences of the invention can be carried out by standard DNA synthesis techniques. 

In another embodiment, the SES protein expression vector is a yeast expression 
vector. Examples of vectors for expression in yeast 5. cenvisae include pYepSecl 
(Baldari. et al.. (1987) Embo J. 6:229-234). pMFa (Kurjan and Herskowitz. ( 982) Cell 
15 30 933-943). P JRY8S (Schultz et al., (1987) Gene 54:1 13-123), and pYES2 (Invitrogen 
Corporation. San Diego, CA). Vectors and methods for the construction of vectors 
appropriate for use in other fungi, such as the filamentous fungi, include those detailed 
in- van den Hondel. C.A.M.J-J- & Punt. P J. (1991) "Gene transfer systems and vector 
development for filamentous fungi, in: Applied Molecular Genetics of Fungi. J.F. 
20 Peberdy, et al.. eds., p. 1-28, Cambridge University Press: Cambridge. 

Alternatively, the SES proteins of the invention can be expressed in insect cells 
using baculovirus expression vectors. Baculovirus vectors available for express- on of 
proteins in cultured insect cells (e.g., Sf 9 cells) include the pAc series (Smith «t d 
(1983) Mol. Cell Biol. 3:2156-2165) and the pVL series (Lucklowand Summers (1989) 

25 Virology 170:31-39). . . 

I„ mother embodiment, the SES proteins of the invention may be expressed m 
unicellular plant cells (such as algae) or in plant cells from higher plants (e.g.. the 
spermatophy.es. such as crop plan.,,. Examples 

those detailed in: Becker. D_ Kemper, E.. Schell, J. and Masterson. R. <1 992) ^e~ 
50 oUnt binary vectors selccuble markers located proximal to the left border . MM 
M M 20: 1195-1197: and Bevan. M.W. ,,984) "Binary Xr— *- ~« fo ' 
plant transformation". Ar«c/./lcirf.ii«. 12:8711-8721. 

In yet another embodiment a nucleic acid of me invenuon ,s expressed u. 
mammalian cells using a mammalian expression vector. Examples of 
35 expression vectors include pCDMS (Seed. B. (1987) M»~ 329:840) and pMT2PC 
SLfman e, al. (1987) EMBO J. 6:187-195). When used in mammalian cell . the 

-to* control functions are often provided by viral regulatory element.. 
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For example, commonly used promoters arc derived from polyoma. Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
prokaryotic and eukaryotic cells see chapters 1 6 and 1 7 of Sambrook, J.. Fritsh. E. F., 
and Maniatis. T. Molecular Cloning: A Laboratory Manual 2nd. ed, Cold Spring 
Harbor Laboratory t Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. 
1989. 

In another embodiment, the recombinant mammalian expression vector is 
capable of directing expression of the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements aTe known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific: Pinkert et al. 
(1987) Genes Dev. 1 :268-277), lymphoid-specific promoters (Calame and Eaton (1988) 
Adv. Immunol 43 :235-275). in particular promoters of T cell receptors (Winoto and 
Baltimore (1989) EMBOJ. 8:729-733) and immunoglobulins (Banerji et al. (1983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741-748). neuron-specific promoters 
(e.g., the neurofilament promoter; Byrne and Ruddle (1989) PNAS 86:5473-5477), 
pancreas-specific promoters (Ediund et al. (1 985) Science 230:912-916), and mammary 
gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 4,873,3 1 6 and 
European Application Publication No. 264,166). Developmentally-rcgulated promoters 
aTe also encompassed, for example the murine hox promoters (Kessel and Gruss (1990) 
Science 249:374-379) and the a-fctoprotein promoter (Campes and Tilghman (1989) 
Genes Dev. 3:537-546). 

The invention further provides a recombinant expression vector comprising a 
DNA molecule of the invention cloned into the expression vector in an antisense 
orientation. That is, the DNA molecule is operatively linked to a regulatory sequence in 
a manner which allows for expression (by transcription of the DNA molecule) of an 
RNA molecule which is antisense to SES mRNA. Regulatory sequences operatively 
linked to a nucleic acid cloned in the antisense orientation can be chosen which direct 
the continuous expression of the antisense RNA molecule in a variety of cell types, for 
instance viral promoters and/or enhancers, or regulatory sequences can be chosen which 
direct constitutive, tissue specific or cell type specific expression of antisense RNA. 
The antisense expression vector can be in the form of a recombinant plasmid, phagemid 
or attenuated virus in which antisense nucleic acids are produced under the control of a 
high efficiency regulatory region, the activity of which can be determined by the cell 
type into which the vector is introduced. For a discussion of the regulation of gene 
expression using antisense genes see Weintraub, H. et aL Antisense RNA as a 
molecular tool for genetic analysis. Reviews - Trends in Genetics \ Vol. 1(1) (1 986). 
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Another aspect of the invention pertains to host cells into which a recombinant 
expression vector of the invention has been introduced. The ternis "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such 
terms refer not only to the particular subject cell but to the progeny or potential progeny 
of such a cell. Because certain modifications may occur in succeeding generations due 
to either mutation or environmental influences, such progeny may not. in fact, be 
identical to the parent cell but are still included within the scope of the term as used 
herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, an SES 
protein can be expressed in bacterial cells such as C glutamicum, insect cells, yeast or 
mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells). Other 
suitable host cells are known to those skilled in the art. Microorganisms related to 
Corynehacterium glutamicum which may be conveniently used as host cells for the 
nucleic acid and protein molecules of the invention are set forth in Table 3. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection", "conjugation" and 'transduction" are intended to 
refer to a variety of art-recognized techniques for introducing foreign nucleic acid (e.g.. 
DNA) into a host cell, including calcium phosphate or calcium chloride co-precipitation, 
DEAE-dextran-mediated transfection, Iipofection, natural competence, chemical- 
mediated transfer, or clectroporation. Suitable methods for transforming or transfecting 
host cells can be found in Sambrook. et al. {Molecular Cloning A Laboratory Manual 
2nd, ed.. Cold Spring Harbor Laboratory. Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1989), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Preferred 
selectable markers include those which confer resistance to drugs, such as G41 8, 
hygromycin and methotrexate. Nucleic acid encoding a selectable marker can be 
introduced into a host cell on the same vector as that encoding an SES protein or can be 
introduced on a separate vector. Cells stably transfected with the introduced nucleic 
acid can be identified by, for example, drug selection (e.g., cells that have incorporated 
the selectable marker gene will survive, while the other cells die). 

To create a homologous recombinant microorganism, a vector is prepared which 
contains at least a portion of an SES gene into which a deletion, addition or substitution 
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has been introduced to thereby alter, e.g.. functionally disrupt, the SES gene. 
Preferably, this SES gene is a Corynebacterium glutamicum SES gene, but it can be a 
homologue from a related bacterium or even from a mammalian, yeast, or insect source. 
In a preferred embodiment, the vector is designed such that, upon homologous 
5 recombination, the endogenous SES gene is functionally disrupted (i.e.. no longer 
encodes a functional protein: also referred to as a "knock out" vector). Alternatively, 
the vector can be designed such that, upon homologous recombination, the endogenous 
SES gene is mutated or otherwise altered but still encodes functional protein (c.g. r the 
upstream regulatory region can be altered to thereby alter the expression of the 
1 0 endogenous SES protein). In the homologous recombination vector, the altered portion 
of the SES gene is flanked at its 5' and 3' ends by additional nucleic acid of the SES 
gene to allow for homologous recombination to occur between the exogenous SES gene 
carried by the vector and an endogenous SES gene in a microorganism. The additional 
flanking SES nucleic acid is of sufficient length for successful homologous 
1 5 recombination with the endogenous gene. Typically, several kilobases of flanking DNA 
(both at the 5" and 3' ends) are included in the vector (see e.g.. Thomas. K.R.. and 
Capecchi, M.R. (1 987) Cell 51 : 503 for a description of homologous recombination 
vectors). The vector is introduced into a microorganism (e.g.. by electroporation) and 
cells in which the introduced SES gene has homologously recombined with the 
20 endogenous SES gene are selected, using art-known techniques. 

In another embodiment, recombinant microorganisms can be produced which 
contain selected systems which allow for regulated expression of the introduced gene. 
For example, inclusion of an SES gene on a vector placing it under control of the lac 
operon permits expression of the SES gene only in the presence of IPTG. Such 
25 regulatory systems are well known in the art. 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (i.e.. express) an SES protein. Accordingly, the 
invention further provides methods for producing SES proteins using the host cells of 
the invention. In one embodiment, the method comprises culturing the host cell of 
30 invention (into which a recombinant expression vector encoding an SES protein has 

been introduced, or into which genome has been introduced a gene encoding a wild-type 
or altered SES protein) in a suitable medium until SES protein is produced. In another 
embodiment, the method further comprises isolating SES proteins from the medium or 
the host cell. 

35 
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C. Isolated SES Proteins 

Another aspect of the invention pertains to isolated SES proteins, and 
biologically active portions thereof. An "isolated" or "purified" protein or biologically 
active portion thereof is substantially free of cellular material when produced by 
5 recombinant DNA techniques, or chemical precursors or other chemicals when 

chemically synthesized. The language "substantially free of cellular material" includes 
preparations of SES protein in which the protein is separated from cellular components 
of the cells in which it is naturally or recombinantly produced. In one embodiment, the 
language "substantially free of cellular material" includes preparations of SES protein 

10 having less than about 30% (by dry weight) of non-SES protein (also referred to herein 
as a "contaminating protein"), more preferably less than about 20% of non-SES protein, 
still more preferably less than about 10% of non-SES protein, and most preferably less 
than about 5% non-SES protein. When the SES protein or biologically active portion 
thereof is recombinantly produced, it is also preferably substantially free of culture 

1 5 medium, i.e., culture medium represents less than about 20%, more preferably less than 
about 10%. and most preferably less than about 5% of the volume of the protein 
preparation. The language "substantially free of chemical precursors or otheT 
chemicals" includes preparations of SES protein in which the protein is separated from 
chemical precursors or other chemicals which are involved in the synthesis of the 

20 protein. In one embodiment, the language "substantially free of chemical precursors or 
other chemicals" includes preparations of SES protein having less than about 30% (by 
dry weight) of chemical precursors or non-SES chemicals, more preferably less than 
about 20% chemical precursors or non-SES chemicals, still more preferably less than 
about 10% chemical precursors or non-SES chemicals, and most preferably less than 

25 about 5% chemical precursors or non-SES chemicals. In preferred embodiments, 

isolated proteins or biologically active portions thereof lack contaminating proteins from 
the same organism from which the SES protein is derived. Typically, such proteins are 
produced by recombinant expression of. for example, a C glutamicum SES protein in a 
microorganism such as C. glutamicum. 

30 An isolated SES protein or a portion thereof of the invention can participate in 

the repair or recombination of DNA, in the transposition of genetic material, in gene 
expression (i.e.. the processes of transcription or translation), in protein folding, or in 
protein secretion in Coryne bacterium glutamicum, or has one or more of the activities 
set forth in Table 1 . Tn preferred embodiments, the protein or portion thereof comprises 

35 an amino acid sequence which is sufficiently homologous to an amino acid sequence of 
Appendix B such that the protein or portion thereof maintains the ability to participate in 
the repair or recombination of DNA. in the transposition of genetic material, in gene 
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expression (i.e., the processes of transcription or translation"), in protein folding, or in 
protein secretion in Corynebacterium glutamicum. The portion of the protein is 
preferably a biologically active portion as described herein. In another preferred 
embodiment, an SES protein of the invention has an amino acid sequence shown in 
Appendix B. In yet another preferred embodiment, the SES protein has an amino acid 
sequence which is encoded by a nucleotide sequence which hybridizes, e.g., hybridizes 
under stringent conditions, to a nucleotide sequence of Appendix A. In still another 
preferred embodiment the SES protein has an amino acid sequence which is encoded by 
a nucleotide sequence that is at least about 50-60%, preferably at least about 60-70%. 
more preferably at least about 70-80%, 80-90%, 90-95%. and even more preferably at 
least about 96%, 97%, 98%, 99% or more homologous to one of the amino acid 
sequences of Appendix B. The preferred SES proteins of the present invention also 
preferably possess at least one of the SES activities described herein. For example, a 
preferred SES protein of the present invention includes an amino acid sequence encoded 
by a nucleotide sequence which hybridizes, e.g.. hybridizes under stringent conditions, 
to a nucleotide sequence of Appendix A, and which can participate in the repair or 
recombination of DNA, in the transposition of genetic material, in gene expression (i.e.. 
the processes of transcription or translation), in protein folding, or in protein secretion in 
Corynebacterium glutamicum. or which has one or more of the activities set forth in 
Table 1. 

In other embodiments, the SES protein is substantially homologous to an amino 
acid sequence of Appendix B and retains the functional activity of the protein of one of 
the sequences of Appendix B yet differs in amino acid sequence due to natural variation 
or mutagenesis, as described in detail in subsection I above. Accordingly, in another 
embodiment, the SES protein is a protein which comprises an amino acid sequence 
which is at least about 50-60%. preferably at least about 60-70%. and more preferably at 
least about 70-80. 80-90. 90-95%. and most preferably at least about 96%. 97%. 98%. 
99% or more homologous to an entire amino acid sequence of Appendix B and which 
has at least one of the SES activities described herein. In another embodiment, the 
invention pertains to a full length C. glutamicum protein which is substantially 
homologous to an entire amino acid sequence of Appendix B. 

Biologically active portions of an SES protein include peptides comprising 
amino acid sequences derived from the amino acid sequence of an SES protein, e.g.. the 
an amino acid sequence shown in Appendix B or the amino acid sequence of a protein 
homologous to an SES protein, which include fewer amino acids than a full length SES 
protein or the full length protein which is homologous to an SES protein, and exhibit at 
least one activity of an SES protein. Typically, biologically active portions (peptides. 
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e.g., peptides which are, for example, 5, 10. 15. 20. 30, 35. 36. 37, 38. 39. 40. 50. 100 or 
more amino acids in length) comprise a domain or motif with at least one activity of an 
SES protein. Moreover, other biologically active portions, in which other regions of the 
protein are deleted, can be prepared by recombinant techniques and evaluated for one or 
more of the activities described herein. Preferably, the biologically active portions of an 
SES protein include one or more selected domains/motifs or portions thereof having 
biological activity. 

SES proteins are preferably produced by recombinant DN A techniques. For 
example, a nucleic acid molecule encoding the protein is cloned into an expression 
vector (as described above), the expression vector is introduced into a host cell (as 
described above) and the SES protein is expressed in the host cell. The SES protein can 
then be isolated from the cells by an appropriate purification scheme using standard 
protein purification techniques. Alternative to recombinant expression, an SES protein, 
polypeptide, or peptide can be synthesized chemically using standard peptide synthesis 
techniques. Moreover, native SES protein can be isolated from cells (e.g.. endothelial 
cells), for example using an anti-SES antibody, which can be produced by standard 
techniques utilizing an SES protein or fragment thereof of this invention. 

The invention also provides SES chimeric or fusion proteins. As used herein, an 
SES "chimeric protein" or "fusion protein" comprises an SES polypeptide operatively 
linked to a non-SES polypeptide. An "SES polypeptide" refers to a polypeptide having 
an amino acid sequence corresponding to an SES protein, whereas a "non-SES 
polypeptide" refers to a polypeptide having an amino acid sequence corresponding to a 
protein which is not substantially homologous to the SES protein, e.g., a protein which 
is different from the SES protein and which is derived from the same or a different 
organism. Within the fusion protein, the term "operatively linked" is intended to 
indicate that the SES polypeptide and the non-SES polypeptide are fused in-frame to 
each other. The non-SES polypeptide can be fused to the N-terminus or C-termmus of 
the SES polypeptide. For example, in one embodiment the fusion protein is a GST-SES 
fusion protein in which the SES sequences are fused to the C-terminus of the GST 
sequences. Such fusion proteins can facilitate the purification of recombinant SES 
proteins. In another embodiment, the fusion protein is an SES protein containing a 
heterologous signal sequence at its N-terminus. In certain host cells (e.g., mammalian 
host cells), expression and/or secretion of an SES protein can be increased through use 
of a heterologous signal sequence. 

Preferably, an SES chimeric or fusion protein of the invention is produced by 
standard recombinant DMA techniques. For example, DNA fragments coding for the 
different polypeptide sequences are ligated together in-frame in accordance with 
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conventional techniques, for example by employing blunt-ended or stagger-ended 
termini for ligation, restriction enzyme digestion to provide for appropriate termini, 
filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid 
undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene 
can be synthesized by conventional techniques including automated DNA synthesizers. 
Alternatively. PCR amplification of gene fragments can be carried out using anchor 
primers which give rise to complementary overhangs between two consecutive gene 
fragments which can subsequently be annealed and reamplified to generate a chimeric 
gene sequence (see, for example. Current Protocols in Molecular Biology, eds. Ausubel 
0 et al. John Wiley & Sons: 1992). Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g.. a GST polypeptide). An SES- 
encoding nucleic acid can be cloned into such an expression vector such that the fusion 
moiety is linked in-frame to the SES protein. 

Homologues of the SES protein can be generated by mutagenesis, e.g., discrete 
5 point mutation or truncation of the SES protein. As used herein, the term "homologue" 
refers to a variant form of the SES protein which acts as an agonist or antagonist of the 
activity of the SES protein. An agonist of the SES protein can retain substantially the 
same or a subset, of the biological activities of the SES protein. An antagonist of the 
SES protein can inhibit one or more of the activities of the naturally occurring form of 
20 the SES protein, by, for example, competitively binding to a downstream or upstream 
member of a biochemical cascade which includes the SES protein, by binding to a target 
molecule with which the SES protein interacts, such that no function interaction is 
possible, or by binding directly to the SES protein and inhibiting its normal astmty. 

In an alternative embodiment, homologues of the SES protein can be identified 
25 by screening combinatorial libraries of mutants, e.g.. truncation mutants, of the SES 
protein for SES protein agonist or antagonist activity. In one embodiment, a vanegatcd 
library of SES variants is generated by combinatorial mutagenesis at the nucleic acd 
level and is encoded by a variegated gene library. A variegated library of SES variants 
can be produced by, for example, enzymatically ligating a mixture of synthetic 
30 oligonucleotides into gene sequences such that a degenerate set of potential SES 

sequences is expressible as individual polypeptides, or alternatively, as a set of larger 
fusion proteins (e.g., for phage display) containing the set of SES ^ nces ^ 
There are a variety of methods which can be used to produce libraries of potermal SES 
homologues from a degenerate oligonucleotide sequence. Chemical synthesis of a 
3 5 degenerate E ene sequence can be performed in an automatic DNA synthesizer, and the 
symhetic ge~ne then ligated into an appropriate expression vector. Use of a degenerate 
eTf genes allows for the provision, in one mixture, of all of the sequences encoding 
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the desired set of potential SES sequences. Methods for synthesizing degenerate 
oligonucleotides are known in the art (sec. e.g.. Narang. S.A. (1983) Tetrahedron 39:3: 
Itakuractal.(1984M«n". Rev. Biochem 53:323: Itakuraet at. (1984) Science 
198:1 056: Ike et al. (1983) Nucleic Acid Res. 1 1 =477. 

In addition, libraries of fragments of the SES protein coding can be used to 
generate a variegated population of SES fragments for screening and subsequent 
selection of homologues of an SES protein. In one embodiment, a library of coding 
sequence fragments can be generated by treating a double stranded PCR fragment of an 
SES coding sequence with a nuclease under conditions wherein nicking occurs only 
about once per molecule, denaturing the double stranded DNA. renaturing the DNA to 
form double stranded DNA which can include sense/antisense pairs from different 
nicked products, removing single stranded portions from reformed duplexes by 
treatment with SI nuclease, and ligating the resulting fragment library into an expression 
vector. By this method, an expression library can be derived which encodes N-terminal, 
C-terminal and internal fragments of various sires of the SES protein. 

Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. Such techniques are adaptable for 
rapid screening of the gene libraries generated by the combinatorial mutagenesis of SES 
homologues. The most widely used techniques, which are amenable to high through-put 
analysis, for screening large gene libraries typically include cloning the gene library into 
replicable expression vectors, transforming appropriate cells with the resulting library of 
vectors, and expressing the combinatorial genes under conditions in which detection of a 
desired activity facilitates isolation of the vector encoding the gene whose product was 
detected. Recursive ensemble mutagenesis (REM), a new technique which enhances the 
frequency of functional mutants in the libraries, can be used in combination with the 
screening assays to identify SES homologues (Arkin and Yourvan (1992) PNAS 
59:781 1-7815: Delgrave et al. (1993) Protein Engineering 6(3):327-331). 

In another embodiment, cell based assays can be exploited to analyze a 
variegated SES library, using methods well known in the art. 

D Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein homologues. fusion proteins, 
primers, vectors, and host cells described herein can be used in one or more of the 
following methods: identification of C. glutamicum and related organisms: mapping of 
genomes of organisms related to C glutamicum; identification and locahzation of C. 
glutamicum sequences of interest: evolutionary studies: determination of SES protein 
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regions required for function: modulation of an SES protein activity; modulation of the 
metabolism of one or more cell membrane components: modulation of the 
transmembrane transport of one or more compounds: and modulation of cellular 
production of a desired compound, such as a Fine chemical. 

The SES nucleic acid molecules of the invention have a variety of uses. First 
they may be used to identify an organism as being Corynebacterium gluiamicum or a 
close relative thereof. Also, they may be used to identify the presence of C glutamicum 
or a relative thereof in a mixed population of microorganisms. The invention provides 
the nucleic acid sequences of a number of C glutamicum genes: by probing the 
extracted genomic DNA of a culture of a unique or mixed population of microorganisms 
under stringent conditions with a probe spanning a region of a C. glutamicum gene 
which is unique to this organism, one can ascertain whether this organism is present. 
Although Corynebacterium glutamicum itself is nonpathogenic, it is related to 
pathogenic species, such as Corynebacterium diphtheriae. Detection of such organisms 
is of significant clinical relevance. 

Further the nucleic acid and protein molecules of the invention may serve as 
markers for specific regions of the genome. This has utility not only in the mapping of 
the genome, but also for functional studies of C glutamicum proteins. For example, to 
identify the ration of the genome to which a particular C. glutamicum DNA-bmdmg 
protein bind^the C glutamicum genome could be digested, and the fragments incubated 
with the DNA-binding protein. Those which bind the protein may be additionally probed 
with the nucleic acid molecules of the invention, preferably with readily detectable 
labels: bindine of such a nucleic acid molecule to the genome fragment enables the 
localization of the fragment to the genome map of C glutamicum. and. when performed 
multiple times with different enzymes, facilitates a rapid determination of the nucleic 
acid sequence to which the protein binds. Further, the nucleic acid molecules of the 
invention may be sufficiently homologous to the sequences of related species such that 
these nucleic acid molecules may serve as markers for the construction of a genomic 
map in related bacteria, such as BrevibacteHum lacto/ermentum. 

The SES nucleic acid molecules of the invention are also useful for evolutionary 
and protein structural studies. The metabolic and transport processes in which the 
m olecules of the invention participate are utilized by a wide variety of prokaryoUc and 
eukaryotic cells; by comparing the sequences of the nucleic acid molecules of the 
present invention to those encoding similar enzymes from other organisms the 

permits an assessment of which regions of the s = re 

not, which may aid in deterrnining those regions of the protein wnicn ar 
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• r*u ~,~me This type of determination is of value for protein 

proteins, -mcsc proteins may be ^proved m cfBetency or «* * * P 
example, by decrcaSmg , of the cell to corre ct genetic errors, which 

of transposons or by increasing the number act.v,tv - WMP ^ 
possible to increase the genetic stability of C z lu,am,cum. wh.ch an . . 

r™^. Such manipulation is possible by placed of one or more 

Modulation of proteins £ productio n of a fine 

^cun, can have bo* ^^^^^J^ > ^ ~»ich 
chemical from these rmcroorgamsms. For exarnp y 1 
dtarfy translates a gene (e.g., a polymerase, ^ of 

» (e.g.. a repressor or acuva^r protein). ,t *e « 

degradation of a fine chemical, this type o g a repressor protein such 

effect on the production of this fine chemtcaL ^ such 

that it can no longer repress its target gene, or mutagenesis °« J 
35 that i, is optimized in aenvity should lead to an = nc *en - 

gene. If the targe, gene is. for example, a fine chemical ^ g nmnber rf 
increase in production of that chemical may result, due to the over gr 
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transcripts present for the gene, which should result in greater numbers of the protein as 
well. Increasing the number or activity of a repressor protein for a target sequence or 
decreasing the number or activity of an activator protein for a target sequence when this 
sequence is. for example, a fine chemical degradative protein, then a similar increase in 
production of the fine chemical should result. 

Indirect effects on Fine chemical production may also arise due to manipulation of 
proteins involved in transcription and translation. For example, by modulating the 
activity or number of transcription factors (e.g.. the sigma factors) or translational 
repressors/activators which globally regulate transcription in C glutamicum in response 
to environmental or metabolic factors, it should be possible to uncouple cellular 
transcription from environmental or metabolic regulation. In turn, this may permit 
continued transcription under conditions which would normally slow or altogether stop 
gene expression, such as those unfavorable conditions (e.g.. high temperature, low 
oxygen, high waste product levels) which exist in large-scale fermentor cultures. By 
increasing the rate of gene (e.g.. fine chemical biosynthetic gene) expression in such 
situations, the overall rate of fine product production may also be increased, at least due 
to the relatively greater number of fine chemical biosynthetic proteins in the cell. 
Principles and examples for modification of transcription and transcriptional regulation 
are described in. e.g.. Lewin. B. (1 990) Genes IV, Part 3: "Controlling procaryotic genes 
by transcription" Oxford Univ. Press: Oxford, p. 213-301. 

Modulation of the activity or number of proteins involved in polypeptide folding 
(e.g., chaperones) may permit an increase in the overall production of correctly folded 
molecules in the cell. This has two effects: first, an overall increase in the number of 
proteins in the cell, due to the fact that fewer proteins are misfolded and degraded, and 
second, an increase in the number of any given protein that is correctly folded and thus 
active (see. e.g.. Thomas. J.G.. Baneyx, F. (1997) Protein Expression W Purification 
1 1(3): 289-296; Luo. Z-H-. and Hua, Z.C. (1998) Biochemistry and Molecular Biology 
International 16(3): 471-477; Dale. G.E.. et al. (1994) Protein Engineering 7(7): 925- 
931 ; Amrcin, K.E. etal. (1995> Proc. Natl Acad. Set U.S.A. 92(4): 1048-1052: and 
Caspers, P. et al. (1994) Cell. Mol. Biol. 40(5): 635-644). While such mutations result 
in an increase in the number of active proteins of all kinds, when coupled with 
additional mutations increasing the activity or number of, e.g.. a fine chemical 
biosynthetic protein, an additive effect in the amount of correctly folded, active desired 
protein may be obtained. 

Manipulation of proteins involved in secretion of polypeptides from C. 
glutamicum such that they are improved in activity or number may directly improve the 
secretion of a proteinaceous fine chemical (e.g.. an enzyme) from this microorganism. Tt 
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is significantly easier to harvest and purify fine chemicals when they are secreted into 
the medium of Jarge-scaJe cultures than when they arc retained in the cell, so the yield 
and production of a fine chemical should be increased through such secretion system 
engineering. Genetic manipulation of these secretion proteins may also Tesult in indirect 
improvements in the production of one or more fine chemicals. First, increased or 
decreased activity of one or more C glutamicum secretion systems (as brought about by 
mutagenesis of one or more SES proteins involved in such pathways) may result in 
increased or decreased global secretion rates from the cell. Many such secreted proteins 
have functions critical for cell viability (e.g.. cell surface proteases or receptors). An 
alteration of a secretory pathway such that these proteins are more readily transported to 
their extracellular location may improve the overall viability of the cell, and thus result 
in greater numbers of C glutamicum cells capable of producing fine chemicals during 
large-scale culture. Second, certain bacterial secretion systems, (e.g., the sec system) 
are known to play a significant role in the process by which integral membrane proteins 
(e.g. channels, pores, or transporters) insert into cellular membranes. If the activity of 
one or more secretory pathway proteins is increased, then the ability of the cell to 
produce fine chemicals may be similarly increased, due to the presence of increased 
intracellular nutrient levels or decreased intracellular waste levels. If the activity of one 
or more such secretory pathway protein is decreased, then there may be insufficient 
nutrients available to support overproduction of desired compounds, or waste products 
may interfere with the biosynthesis of desired fine chemicals. 

The aforementioned mutagenesis strategies for SES proteins to result in 
increased yields of a fine chemical from C giulamicum are not meant to be limiting: 
variations on these strategies will be readily apparent to one skilled in the art. Using 
such strategies, and incorporating the mechanisms disclosed herein, the nucleic acid and 
protein molecules of the invention may be utilized to generate C. glutamicum or related 
strains of bacteria expressing mutated SES nucleic acid and protein molecules such that 
the yield, production, and/or efficiency of production of a desired compound is 
improved. This desired compound may be any product produced by C glutamicum, 
which includes the final products of biosynthesis pathways and intermediates of 
naturally-occurring metabolic pathways, as well as molecules which do not naturally 
occur in the metabolism of C. giulamicum, but which are produced by a C. glutamicum 
strain of the invention. 
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This invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references, patent applications, patents, and 
published patent applications cited throughout this application are hereby incorporated 
by reference. 

5 

Exemplification 

Example 1 : Preparation of total genomic DNA of Corynebacterium glutamicum 
ATCC 13032 

1 0 A culture of Corynebacterium glutamicum (ATCC 1 3032) was grown overnight 

at 30°C with vigorous shaking in BHI medium (Difco). The cells were harvested by 
centrifugation. the supernatant was discarded and the cells were resuspended in 5 ml 
buffer-I (5% of the original volume of the culture — all indicated volumes have been 
calculated for 100 ml of culture volume). Composition of buffcr-I: 140.34 g/1 sucrose. 

15 2.46 g/1 MgSO. x 7H 2 0, 10 ml/1 KH,PO. solution (100 g/L adjusted to pH 6.7 with 
KOH). 50 ml/1 Ml 2 concentrate (10 g/1 (NRi) 2 SO-, 1 g/1 Nad, 2 g/1 MgSO, x 7H,0, 
0.2 g/1 CaCl 2 . 0.5 g/1 yeast extract (Difco). 10 ml/1 trace-elements-mix (200 mgA FeS0 4 
x H 2 0, 1 0 mg/1 ZnSO, x 7 H 3 0. 3 mg/t MnCl 2 x 4 H 3 CX 30 mg/1 H 3 B0 3 20 mg/1 CoCl, x 
6 H,0, 1 mg/1 NiCl 2 x 6 H 2 0, 3 mg/1 Na^oO. x 2 H 2 0, 500 mg/1 complexing agent 

20 (EDTA or critic acid). 100 ml/1 vitamins-mix (0.2 mg/1 biotin. 0.2 mg/1 folic acid. 20 
mg/1 p-amino benzoic acid. 20 mg/1 riboflavin, 40 mg/1 ca-panthothenate. 140 mg/1 
nicotinic acid, 40 mg/1 pyridoxole hydrochloride, 200 mg/1 myo-inositol). Lysozyme 
was added to the suspension to a final concentration of 2.5 mg/ml. After an 
approximately 4 h incubation at 37X. the cell wall was degraded and the resulting 

25 protoplasts are harvested by centrifugation. The pellet was washed once with 5 ml 
buffcr-I and once with 5 ml TE-buffer (10 mM Tris-HCl. 1 mM EDTA. pH 8). The 
pellet was resuspended in 4 ml TE-buffer and 0.5 ml SDS solution (10%) and 0.5 ml 
NaCl solution (5 M) are added. After adding of proteinase K to a final concentration of 
200 ng/ml. the suspension is incubated for ca.18 h at 37°C. The DNA was purified by 

30 extraction with phenol, phenol-chloroform-isoamylalcohol and chloroform- 

isoamylalcohol using standard procedures. Then, the DNA was precipitated by adding 
1/50 volume of 3 M sodium acetate and 2 volumes of ethanol, followed by a 30 min 
incubation at -20°C and a 30 min centrifugation at 12.000 rpm in a high speed centrifuge 
using a SS34 rotor (Sorvall). The DNA was dissolved in 1 ml TE-bufTer containing 20 
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Hg/ml RNaseA and dialysed at &°C against 1000 ml TE-buffer for at least 3 hours. 
During this time, the buffer was exchanged 3 times. To aliquots of OA ml of the 
dialysed DNA solution, 0.4 ml of 2 M LiCl and 0.8 ml of ethanol are added. After a 30 
min incubation at -20°C. the DNA was collected by centrifiigation (13,000 rpm^Biofuge 
5 Fresco. Heraeus* Hanau. Germany). The DNA pellet was dissolved in TE-buffer. DNA 
prepared by this procedure could be used for all purposes, including southern blotting or 
construction of genomic libraries. 

Example 2: Construction of genomic libraries in Escherichia coli of Corynebactcrium 

1 0 gtutamicum ATCC 1 3032. 

Starting from DNA prepared as described in Example 1, cosmid and plasmid 
libraries were constructed according to known and well established methods {see e.g., 
Sambrook, J. et al (1989) ''Molecular Cloning : A Laboratory ManuaT\ Cold Spring 
Harbor Laboratory Press, or AusubeL F.M. et al (1 994) "Current Protocols in Molecular 

1 5 Biology". John Wiley & Sons.) 

Any plasmid or cosmid could be used. Of particular use were the plasmids pBR322 
(Sutcliffe, J.G. (1979) Proc. Natl. Acad. Sci. USA, 75:3737-3741); pACYC177 (Change & 
Cohen (1978) J. Bactcriol 134:1 141-1 156), plasmids of the pBS series (pBSSK+ pBSSK- and 
others; Stratagene. LaJolla, USA), or cosmids as SuperCosl (Stratagene, LaJolla, USA) or 

20 Lorist6 (Gibson. T.J.. Rosenthal A. and Waterson. R.H. (1987) Gene 53:283-286. 

Example 3: DNA Sequencing and Computational Functional Analysis 

Genomic libraries as described in Example 2 were used for DNA sequencing 
according to standard methods, in particular by the chain termination method using 
25 ABI377 sequencing machines (see e.g., Fleischman. RD. ct al. (1 995) "Whole-genome 
Random Sequencing and Assembly of Haemophilus Influenzae Rd., Science, 269:496- 
512). Sequencing primers with the following nucleotide sequences were used: 5 % - 
GGAAAC AGTATGACC ATG-3 ' or 5 4 -GT AAAACGACGGCCAGT-3 * . 

30 Example 4: In vivo Mutagenesis 

In vivo mutagenesis of Corynebactcrium glutamicum can be performed by passage of 
plasmid (or other vector) DNA through E. coli or other microorganisms (e.g. Bacillus spp. or 
yeasts such as Saccharomyces cerevisiae) which are impaired in their capabilities to maintain 
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the intesrity of their genetic informer.. Typical mutator strains have mutations . *c genes 
for .he DNA repair system (e.g., m«HLS. mutD. mutT. etc, for reference, sec Rupp-^D. 
(1996) DNA repair mechanisms, in: EscherM, cofi and Salmonella, p. 2277-2294. ASM: 
Washington.) Such strains are well known to those skilled in the art. The use of such strains » 
, illustrated, for example, in Greener. A. and Callahan. M. (1994) Strategies 7: 32-34. 

Example S: DNA Transfer Between &c*erfc*.«. ctt and Covnetocterium 

glutamicum 

Several Orynebacariw* and Brwbaaerium species contain endogenous 
0 plasmids (as e.g.. pHM1519 or pBLl) which replicate autonomously (for review sec, e.g.. 
Martin. J.F. « aL (1987) Biotechnology. 5:137-146). Shuttle vectors for bMta « • 
and can be readily constructed by using standard vecters for 

£ co/i (Sambrook, J. «' (1989). "Molecular Cloning: A Laboratory Manual , Cold 
Spring Harbor Laboratory Press or Ausubel. F.M. e, al. (1994) "Current Protocols m 
15 Molecular Biology". John Wiley ft Sons) to which a origin or replication for and a 

suitable marker from Corynhocertu* g lu<amicum is added. Such origins of replication 
are preferably taken from endogenous plasmids isolated from Cory»ebac,er,um and 
M— species. Of particular use as transformation markers tor these species arc 
genes for kanamyein resistance (such as those derived from the Tn5 or Tn90 
20 .ransposons) or ch.oramphenico. (Winnacker. EX. (1987) "From O—* 0 ""- 

(ntroduction to Gene Technology. V C H. Weinheim). There are numerous examp.es .ft. 
Uterature of the construction of a wide variety of stamle vectors which repl ca«e « both £ 
co, W C. and which can be used for severe, purposes, ^ over- 

expression (for reference, see e.g, Yoshihama. M. et al. (1985) I. Bactenol. 162J91-597. 
25 Ztin ,F. e, a,. (1987, Biotechnology. 5,37-146 and Eikmanns. B., e, a,. (.991, Gene. 

,W:93 ulg sundard methods, it is possible to clone a gene of interest into one of the 
shuttle vectors described above and to introduce such a hybrid vectors into strains of 
a^— Transformation of C ^^^^ 
30 protoplast transformation (Kastsumata, R. et al. (1984) J. Bacteriol. 159306-.1 1) 
30 protoplast 53:399-303) and m cases 

electroporation(Liebl.E.etal.(i«7iriii , . ^ 

where special vectors are used, also by conjugation (as desenbed e.g. m Schafer. A c, al. 
^UBacUriol. ,72, 663-1666, It is also possible ,0 transfer the shuttle vecto. for 
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C g/utamicum to £ coli by preparing plasmid DNA from C. glutamicum (using standard 
methods well-known in the art) and transforming it into E coli. This transformation step 
can be performed using standard methods, but it is advantageous to use an Mcr-deficient 
E coli strain, such as NM522 (Gough & Murray (1 983) J. Mol. Biol. 166:1 -1 9). 

Example 6: Assessment of the Expression of the Mutant Protein 

Observations of the activity of a mutated protein in a transformed host cell rely on 
the fact that the mutant protein is expressed in a similar fashion and in a similar quantity 
to that of the wild-type protein. A useful method to ascertain the level of transcription of 
the mutant gene (an indicator of the amount of mRNA available for translation to the gene 
product) is to perform a Northern blot (for reference see. for example, Ausubel et al. 
(1988) Current Protocols in Molecular Biology, Wiley: New York), in which a primer 
designed to bind to the gene of interest is labeled with a detectable tag (usually radioactive 
or chemiluminescent), such that when the total RNA of a culture of the organism is 
extracted, run on gel. transferred to a stable matrix and incubated with this probe, the 
binding and quantity of binding of the probe indicates the presence and also the quantity 
of mRNA for this gene. This information is evidence of the degree of transcription of the 
mutant gene. Total cellular RNA can be prepared from Corynebacterium glutamicum by 
several methods, all well-known in the art, such as that described in Bormann, E.R. et al. 
(1992) Afo/. Microbiol. 6: 317-326. 

To assess the presence or relative quantity of protein translated from this mRNA, 
standard techniques, such as a Western blot may be employed (see. for example, Ausubel 
et al. (1988) Current Protocols in Molecular Biology. Wiley: New York). In this process, 
total cellular proteins are extracted, separated by gel electrophoresis, transferred to a 
matrix such as nitrocellulose, and incubated with a probe, such as an antibody, which 
specifically binds to the desired protein. This probe is generally tagged with a 
chemiluminescent or colorimetric label which may be readily detected. The presence and 
quantity of label observed indicates the presence and quantity of the desired mutant 
protein present in the cell. 
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Example 7: Growth of Genetically Modified Corynehacterium glutanticum - Media 

and Culture Conditions 

Genetically modified Corynebacteria are cultured in synthetic or natural growth 
media. A number of different growth media for Corynebacteria are both well-known and 
5 readily available (Lieb et al. (1989) Appl. Microbiol. BiotechnoL 32:205-210: von der 
Osten et al. (1998) Biotechnology Letters. 11:11-16: Patent DE 4,120.867; Liebl (1 992) 
"The Genus Corynehacterium, in: The Procaryotes. Volume II, Balows, A. ct oL eds. 
Springer-Verlag). These media consist of one or more carbon sources, nitrogen sources, 
inorganic salts, vitamins and trace elements. Preferred carbon sources are sugars, such as 
1 0 mono- di-. or polysaccharides. For example, glucose, fructose, mannose. galactose, 
ribose. sorbose, ribulose. lactose, maltose, sucrose, raffinose, starch or cellulose serve as 
very good carbon sources. It is also possible to supply sugar to the media via complex 
compounds such as molasses or other by-products from sugar refinement. It can also be 
advantageous to supply mixtures of different carbon sources. Other possible carbon 
1 5 sources are alcohols and organic acids, such as methanol, ethanol, acetic acid or lactic 
acid. Nitrogen sources are usually organic or inorganic nitrogen compounds, or materials 
which contain these compounds. Exemplary nitrogen sources include ammonia gas or 
ammonia salts, such as NH.C1 or (NR.) 2 SO.. NH.OH, nitrates, urea, amino acids or 
complex nitrogen sources Hke corn steep liquor, soy bean flour, soy bean protein, yeast 

20 extract, meat extract and others. 

Inorganic salt compounds which may be included in the media include the 
chloride-, phosphorous- or sulfate- salts of calcium, magnesium, sodium, cobalt, 
molybdenum, potassium, manganese, zinc, copper and iron. Chelating compounds can be 
added to the medium to keep the metal ions in solution. Particularly useful chelating 
compounds include dihydroxyphenols, like catechol or protocatechuate, or organic acids, 
such as citric acid. It is typical for the media to also contain other growth factors, such as 
vitamins or growth promoters, examples of which include biotin, riboflavin, thiamin, fohc 
acid, nicotinic acid, pantothenate and pyridoxin. Growth factors and salts frequently 
originate from complex media components such as yeast extract, molasses, corn steep 
liquor and others. The exact composition of the media compounds depends strongly on 
the immediate experiment and is individually decided for each specific case. Information 
about media optimization is available in the textbook "Applied Microbiol. Physiology, A 
Practical Approach «* P.M. Rhodes, P.F. Stanbury. TRL Press (1997) pp. 53-73, ISBN 0 
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19 963577 3). It is also possible to select growth media from commercial suppliers, like 
standard 1 (Merck) or BHI (grain heart infusion. DIFC) or others. 

All medium components are sterilized, either by heat (20 minutes at 1 .5 bar and 
121-Q or by sterile filtration. The components can either be sterilized together or, if 
necessary, separately. All media components can be present at the beginning of growth, 
or they can optionally be added continuously or batchwise. 

Culture conditions are defined separately for each experiment. The temperature 
should be in a range between 1 5'C and 45*C. The temperature can be kept constant or can 
be altered during the experiment. The pH of the medium should be in the range of 5 to 
8.5, preferably around 7.0. and can be maintained by the addition of buffers to the media. 
Anexemplary buffer for this purpose is a potassium phosphate buffer. Synthetic buffers 
such as MOPS. HEPES, ACES and others can alternatively or simultaneously be used. It 
is also possible to maintain a constant culture pH through the addition of NaOH or 
NH,OH during growth. If complex medium components such as yeast extract are utilized, 
the necessity for additional buffers may be reduced, due to the fact that many complex 
compounds have high buffer capacities. If a fermentor is utilized for culturing the rmcro- 
organisms. the pH can also be controlled using gaseous ammonia. 

The incubation time is usually in a range from several hours to several days. This 
time is selected in order to permit the maximal amount of product to accumulate in the 
broth. The disclosed growth experiments can be carried out in a variety of vessels, such as 
microliter plates, glass tubes, glass flasks or glass or metal fermentors of different sizes. 
For screening a large number of clones, the microorganisms should be cultured « 
microtiter plates, glass tubes or shake flasks, either with or without baffles. Preferably 
100 ml shake flasks are used, filled with 10% (by volume) of the required growth 
medium The flasks should be shaken on a rotary shaker (amplitude 25 mm) using a 
speed-range of 1 00 - 300 rpm. Evaporation losses can be climinished by the maintenance 
of a humid atmosphere; alternatively, a mathematical correction for evaporation losses 

should be performed. 

If genetically modified clones are tested, an unmodified control clone or a control 
, elo ne containing the basic plasmid -itbou, any insert should also be tested. Th« , medium 
is inoculated to an OD 6M of D.5 - 1 .5 using cells gro»n on agar plates, such as CM plates 
(10 g/1 glucose. 2.5 g/1 Nad. 2 g/i urea. 1 0 g/1 polypeptone. 5 g/1 yeas, extract. 5 g/1 meat 
extract. 22 g/1 Nad. 2 g/! urea, 10 g/1 polypeptone. 5 g/1 yeast extract 5 g/1 meat extract. 
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22 g/1 agar pH 6.8 with 2M NaOH) that had been incubated at 30'C. Inoculation of the 
media is accomplished by either introduction of a saline suspension of C glutamics cells 
from CM plates or addition of a liquid preculture of this bacterium. 

5 Example 8 - In vitro Analysis of the Function of Mutant Proteins 

The determination of activities and kinetic parameters of enzymes is well 
established in the art. Experiments to determine the activity of any given altered 
enzyme must be tailored to the specific activity of the wild-type enzyme, which „ well 
within the ability of one skilled in the art. Overviews about enzymes in general, as well 
10 as specific details concerning structure, kinetics, principles, methods, apphcafcons and 
examples for the determination of many enzyme activities may be found, for example, m 
the following references: Dixon. M.. and Webb, E.G.. (1979) Enzymes. Longmans: 
London; Fersht, (1985) Enzyme Structure and Mechanism. Freeman: New York; 
Walsh (1979) Enzymatic Reaction Mechanisms. Freeman: San Francisco; Price. N.C.. 
15 Stevens. L. (1982) Fundamentals of Enzymology. Oxford Univ. Press: Oxford; Boyer. 
P D , ed. (1983) The Enzymes. 3* ed. Academic Press: New York; Bisswanger. H 
(1994) Enzymkinetik. T A ed. VCH: Weinheim (ISBN 3527300325); Bergmeyer. H.U.. 
Bergmeyer. J.. GraBl, M., eds. (1983-1986) Methods of Enzymatic Analysis 3 ed.. vol. 
I-XII. Verlag Chemie: Weinheim; and Ullmann's Encyclopedia of Industrial Chemistry 
20 (1987) vol. A9. "Enzymes". VCH: Weinheim, p. 352-363. 

The activity of proteins which bind to DNA can be measured by several well- 
established methods, such as DNA band-shift assays (also called gel retardation assays). 
The effect of such proteins on the expression of other -lecules can be ™d usmg 
reporter gene assays (such as that described in Ko.mar, H. et al. (1995) EMBO J. W. 
25 3895-3904 and references cited therein). Reporter gene test systems are well known and 
established for applications in both pro- and eukaryotic cells, using enzymes such as 
beta-galactosidase. green fluorescent protein, and several others. , omeA 
The determination of activity of membrane-transport proteins can be performed 
according to techniques such as those described in Gennis. R.B. (1 989) "Pore^ 
30 Channels and Transporters", in Biomembranes, Molecular Structure and Function, 
Springer: Heidelberg, p. 85-137; 199-234; and 270-322. 

Example 9: Analysis of Impact of Mutant Protein on the Production of the Desired 

Pr ° dUC The effect of the genetic modification in C. glutamic on production of a 

• m\ Kr a« eS ccd by crowing the modified 
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the medium and/or the cellular component for increased production of the des.red 
product (i.e., an amino acid). Such analysis techniques are well known to one skilled in 
the art. and include spectroscopy, thin layer chromatography, staining methods of 
various kinds, enzymatic and microbiological methods, and analytical chromatography 
such as high performance liquid chromatography (see. for example, Ullman. 
Encyclopedia of Industrial Chemistry, vol. A2. p. 89-90 and p. 443-613, VCH: 
Weinheim (1985); Fallon. A. et al., (1987) "Applications of HPLC in Biochemistry ' in: 
Laboratory Techniques in Biochemistry and Molecular Biology, vol. 17; Rehm et al. 
(1993) Biotechnology, vol. 3, Chapter III: "Product recovery and purification page 
469-7H VCH: Weinheim; Belter. P.A. et al. (1988) Bioseparations: downstream 
processing for biotechnology, John Wiley and Sons: Kennedy. J.F. and Cabral. J.M.S. 
(1992) Recovery processes for biological materials, John Wiley and Sons: Sbaeiwitz, 
J A and Henry, J.D. (1988) Biochemical separations, in: Ulmanns Encyclopedia of 
Industrial Chemistry, vol. B3. Chapter 1 1, page 1-27, VCH: Weinheim; and Dechow, 
F.J. (1989) Separation and purification techniques in biotechnology, Noycs 
Publications.) 

In addition to the measurement of the final product of fcrmetitation. it is also 
possible to analyze other components of the metabolic pathways utilized for the 
production of the desired compound, such as intermediates and side-products, to 
determine the overall efficiency of production of the compound. Analysis methods 
include measurements of nutrient levels in the medium (e.g.. sugars, hydrocarbons, 
nitrogen sources, phosphate, and other ions), measurements of biomass composition and 
growth analysis of the production of common metabolites of biosynthetic pathways, and 
measurement of gasses produced during fermentation. Standard methods for these 
measurements are outlined in Applied Microbial Physiology, A Practical Approach. 
P.M. Rhodes and P.F. Stanbury. eds., IRL Press, p. 103-129; 131-163; and 165-192 
(TSBN: 01 99635773) and references cited therein. 

Example 10: Purification of the Desired Product from C. gtutcmicum Culture 
, Recovery of the desired product from the C. ghuamtcw* cells or supernatant of 

ft. above-described culture can be performed by various methods well knovm in the art. 
If the desired product is not secreted from the cells, the cells can be harvested from the 
culture by low-speed centrifugation. the cells can be iysed by standard techniques, such 
asTcchsLcal fL or sonicauon. The cellular debris is removed by centnfcgauon. and 
S the supernatant fraction containing the soluble proteins is retained for further 

pupation of the desired compound. If the product is secreted from the C. */_ 
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ccUs. then the cells are removed from the culture by low-speed centrifugation, and the 
supemate fraction is retained for further purification. 

The supernatant fraction from either purification method is subjected to 
chromatography with a suitable resin, in which the desired molecule is either retained on 
a chromatography resin while many of the impurities in the sample are not, or where the 
impurities are retained by the resin while the sample is not. Such chromatography steps 
may be repeated as necessary, using the same or different chromatography resins. One 
skilled in the art would be well-versed in the selection of appropriate chromatography 
resins and in their most efficacious application for a particular molecule to be purified. 
The purified product may be concentrated by filtration or ultrafiltration, and stored at a 
temperature at which the stability of the product is maximized. 

There are a wide array of purification methods known to the art and the 
preceding method of purification is not meant to be limiting. Such purification 
techniques are described, for example, in Bailey, J.E. & Ollis. D.F. Biochemical 
Engineering Fundamentals. McGraw-Hill: New York (1986). 

The identity and purity of the isolated compounds may be assessed by techniques 
standard in the art. These include high-performance liquid chromatography (HPLC). 
spectroscopic methods, staining methods, thin layer chromatography, NTRS. enzymatic 
assay, or microbiologically. Such analysis methods are reviewed in: Patek et al. (1 994) 
Appl Environ. Microbiol. 60: 133-140: Malakhova et al. (1996) Biotekhnologiya 1 1 : 27- 
32- and Schmidt et al. (1998) Bioprocess Engineer. 19: 67-70. Ulmann s Encyclopedia 
of Industrial Chemistry. (1996) vol. A27 T VCH: Weinheim, p. 89-90, p. 521-540. p. 540- 
547. p. 559-566, 575-581 and p. 581-587; Michal. G. (1999) Biochemical Pathways: An 
Atlas of Biochemistry and Molecular Biology, John Wiley and Sons; Fallon, A. et al. 
(1987) Applications of HPLC in Biochemistry in: Laboratory Techniques in 
Biochemistry and Molecular Biology, vol. 17. 

Equivalents 

Those skilled in the art will recognize, or will be able to ascertain using no more 
than routine experimentation, many equivalents to the specific embodiments of the 
invention described herein. Such equivalents are intended to be encompassed by the 
following claims. 
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BASF Aktiengesellscliaft 990674 O.Z. 0050/50181 

Appendix A & B 

»RXA02436~amino acid sequence 
(1-561 f translated) 187 residues 

WGISLDWM MGVMTSKTAT AILHTNRGDI TIDLFGNHAP ETVANFVGLA QGTKDYQSAN AQGDSEGPFY 
NGSVFHRVID GFMIQGEDPT GTGRGGPGYT FADE FHPELR FDRAYLLAMA NAGPGTNGSQ FFI TVTPTPH 
LNNAHTIFGE VTDAESQKW DAI AT TATDR YDRPADAWI ESVEITA 
>RXA02436-nucleotide sequence A: upstream 

ATCCACTACTAACTTGAGGTTGCCAATCGTGCTCATATTCAACTCTCCTTAAATTGGTCGCGTTCTTCTAAGACCCC 
ACAT T AGTAAAAGCATGGGACAT 

>RXA02436-nucleotide sequence B: coding region 

GTGGTTGGCATTTCGtCTCGATGTAGTTATGATGGGCGTTATGACTTCTAAGACCGCAACTGCGATTCTGCATACAAA 

TCGCGGAGACATCACCATCGACCTGTTCGGCAACCACGCTCCAGAGACCGTCGCTAACTTCGTTGGCCTGGCACAGG 

GCACCAAGGACTACCAGTCCGCAAACGCTCAGGGCGACAGCGAAGGTCCGTTCTACAACGGATCTGTCTTCCACCGC 

GTCATCGACGGTTTCATGATCCAGGGTGAAGACCCAACCGGCACCGGCCGTGGCGGCCCTGGCTACACCTTCGCTGA 

TGAATTCC^CCCAGAGCTGCGCTTCGACCGCGCATACC^ 

CCGAGTTCTTQATCACTGTGACCCCAACCCCTGACC^ 

GAGTCTCAGAAGGTTGTGGATGCAATTGCAACCACCGCAACCGATCGTTACGACCGCCCAGCTGACGCAGTTGTCAT 
CGAGTCTGTAGAGATCACCGCG 

>RXA02436-nucleotide sequence C: downstream 
TAACAGCCACCTCTACGTACACT 
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Appendix A & B 

»RXA02323-amino acid sequence 
(1-924, translated) 308 residues 

MS SHDLVDW WGAGAAGLA AAVALGRSLR SVIVIDAGQP RNSYAHAAHN VLGQE G I APA ELLEKGRAEA 
RSYGVTIAPG RVAKVERTGS TFAITLDDAS LLHSRRI I LA HGAVDDLPEV EGLSDFWGTK VLHCAYCHGF 
EARDSEIVW GTSPMAAHQA IJyIFSQLSKTV SLVGT ID IDE QTSESLDSAG VKVLGTNAVR VSAEGDGLSV 
ELSEGDHIiSC DNIWASRPL VDGTLYTQLG GQMEENPMGR FIPGTQTGRT PIEGVWAAGN AQAPMAMVYG 
SAAQGVMAGA EINFDLILED ISVASAQS 
>RXA02 32 3 -nucleotide sequence A: upstream 

GGAATCATTTTCAATAGAGTCGACGCAAGTGTACACTTCTTAATGGAAATTGTTTTCAATAAAGTCAAGTTTTTTGA 
CCTTCGCTTTTTAGGAGCACCCC 

>RXA02 32 3 -nucleotide sequence B: coding region 

ATGTCATCTCATGATCTCGTTGACGTAGTTGTCGTCGGCGCCGGCGCTGCAGGTCTCGCCGCCGCTGTCGCGCTCGG 
CCGCTCACTGCGCAGTGTCATCGTCATCGACGCTGGTCAACCCCGTAACAGCTATGCGCACGCTGCTCACAATGTCC 
TCGGCCAGGAAGGCATTGCGCCCGCCGAGCTGCTGGAAAAAGGCCGCGCCGAAGCGCGTTCCT^ 

GCGCCCGGGCGCGTAGCAAAAGTTGAGCGCACCGGTTCCACCTTCGCCATAACGCTTGACGACGCCTCCCTCCTTCA 
CTCTCGGCGCATCATTTTGGCCCACGGCGCCGTTGACGATCTGCCAGAGGTAGAAGGACTGTCAGATTTTTGGGGAA 
CCAAAGTGTTGCACTGCGCTTACTGCCACGGCTTTGAGGCCCGCGATTCTGAAATCGTCGTGGTGGGTACCTCGCCC 
ATGGCTGCGCACCAAGCGTTGATGTTCTCGCAGTTGTCCAAAACTGTCAGCTTGGTGGGCACGATCGACATTGATGA 
ACAAACCAGCGAGAGCCTAGATAGTGCTGGAGTAAAAGTGTTGGGCACCAATGCGGTGCGCGTATCCGCCGAAGGTG 
ATGGCCTGTCTGTGGAACTGTCCGtAAGGCGATCATTTAAGCTGCGACAACATCGTGGTGGCATCTCGTCCACTGGTG 
GATGGCACGCTGTACACCCAACTTGGTGGTCAGATGGAAGAAAACCCGATGGGCAGGTTCATTC 

CGGGCGCACTCCTATTGAAGGTGTGTGGGCTGCCGGAAACGCGCAAGCTCCCATGGCGATGGTCTATGGTTCCGCTG 
CTCAAGGCGTGATGGCTGGAGCAGAGATCAACTTTGATCTGATCCTGGAAGATATTTCCGTGGCAAGCGCGCAGAGC 
>RXA023 2 3 -nucleotide sequence C: downstream 
TAAACTGCGTGAGGTTGTGGCCT 



BASF Aktiengesellschaft 990674 O.Z. 0050/50181 

Appendix a & B 

»RXA02047-amino acid sequence 
(1-192, translated) 64 residues 

WDDRVH PAHA RLFAQALLDA GQAVDYYENT EGGHAGAADN KQTAFVESL.I YTWIEKTLDQ QGSI 
>RXA02 04 7 -nucleotide sequence B: coding region 

TGGGACGACCGCGTCCACCCCGCGCACGCGCGCCTTTTTGCTCAAGCTTTGCTTGATGCG6GCCAGGCCGTGGATTA 

CTACGAAAACACCGAGGGCGGCCATGCCGGCGCGGCGGATAACAAGCAGACCGCGTTTGTGGAATCGCTGATCTACA 

CCTGGATCGAAAAGACTTTGGATCAGCAGGGTAGCATT 

>RXA02 047 -nucleotide sequence C: downstream 

TAATACCTAT GAT T ATGCGAAGG 



BASF Aktiengesellschaft 990674 O.Z. 0050/50181 

Appendix A & B 

»RXA01863-amino acid sequence 
(1-807, translated) 269 residues 

MNSNX.TLP QDRYLDIAEY IDVMHISHNW GTTDEFANVG FGAMKKQPPL KAKLKLYEQM ISNARTLSEQ 
GMFVSAETML NQSTLPHLRK I HQEWHDMK CSRHE IHPMY PADFASQLNV LTLAEMKKTI HDILDFRDED 
rWMLFGTLPV FPCLKDDEDQ KLLSRLRNAN NVTTRNDPDG RSRLNVNVFT GNVIVTDFGD ETGTISNIQK 
DKLTDVFDKW LSSDLAKSLN CHCSEFSCLG PNVTjVKNMYY PNMDFKDNER HMHKQPQIIQ F 
>RXA0 18 63 -nucleotide sequence A: upstream 

T T AGAAATGT TGT TAACCTC TAT T AAAGTATGCACATCATCGAGGTAT ATATACACAA 
>RXA0 18 63 -nucleotide sequence B: coding region 

ATGAATTCAAACC T AAGAT TGCC TCAAGATCGT T AT TTAGATAT TGCTGAATAT ATCGATGT TATGCATATC TCACA 
TAACTGGGGAACAACTGATGAATTCGCAAATGTTGGCTTTGGCGCAATGAAGAAGCAACCACCG 

TAAAATTATATGAACAAATGATTTCGAATGCACGTACATTATCAGAACAAGGAATGTTTGTATCTGCGGAAACAATG 

CTCAATCAAAGTACGCTACCACATTTACGAAAAATACATCAAGAAGTCGTTCATGATATGAAATGTAGCAGACACGA 

GAT TCACCCTATGTATCCAGC TGAC T T TGCAAGTCAAT T AAATGTGTTAAC TCTAGCGGAAATGAAAAAGACAAT TC 

ATGATATATTGGATTTCAGAGATGAAGATATTTGGATGTTATTTGGTACTTTGCCTGTGTTTCCATGCTTAAAGGAT 

GATGAAGATCAAAAGTTACTATCACGTTTAAGAAATGCTAACAATGTAACGACTAGAAATGACCCGGATGGCCGTAG 

TCGTTTAAATGTCAATGTATTTACAGGTAATGTAATCGTAACTGATTTCGGAGATGAAACAGGTACAATTTCGAATA 

TACAAAAAGATAAATTAACAGATGTATTTGATAAATGGTTATCCTCTGATCTTGCTAAATCATTAAATTGTCATTGT 

TCCGAGTTTAGTTGTTTAGGACCAAATGTTCTTGTTAAAAATATGTACTATCCGAATATGGATTTTAAAGATAATGA 

GCGTCATATGCACAAACAACCACAAAT T ATACAAT T T 

>RXA0 18 63 -nucleotide sequence C: downstream 

TAAAAAC TC T TAATT ATGCGGAG 



BASF Aktiengesellsctiaft 990674 O.Z. 0050/50181 

Appendix A & B 

»RXA0 184 1 -amino acid sequence 
(1-363, translated) 121 residues 

MEEGEEISLS DFEGEVWLN AWGQWCAPCR AEVDDLQLVQ ETLDPLGGTV LGINVRDYNQ TIAQDFKLDN 

AVTYPSIYDP PFRIAAALGG VPTSVIPTTI VLDRSHRPAA VFLREVTALS G 

>RXA0 1841 -nucleotide sequence A: upstream 

TCGGGGGGCACCTTCCAGTTCCACTCACGAGAAGGCCAACCG 

CGGACTTCTCCGGCCCGTCGCTG 

>RXA0 1841 -nucleotide sequence B: coding region 

ATGGAGGAGGGTGAGGAGATCAGCCTGTCTGATTTTGAAGGCGAGGTCGTCGTCCTCAACGCTTGGGGCCAGTGGTG 

TGCACCGTGTCGGGCGGAAGTCGATGACCTGCAGCTTGTCCAGGAGACTCTCGACCCCCTCGGTGGCACGGTGCTGG 

GCATCAACGTCCGTGACTACAACCAGACCATCGCCCAGGACTTCAAACTCGACAACGCGGTGACCTATCCCTCGATC 

TACGACCCGCCGTTTCGTATCGCTGCGGCCCTGGGTGGGGTGCCGACCTCGGTCATCCCGACCACCATTGTCCTGGA 

CCGAAGCCACCGCCCGGCCGCGGTGTTCCTGAGGGAGGTCACCGCGTTGTCTGGT 

>RXA0 1841 -nucleotide sequence C: downstream 

TAGTCCTGGGAGGTGATGAGCTC 



BASF Aktiengesellschaft 990674 O.Z. 0050/50181 

Appendix A & B 

»RXA01837-amino acid sequence 
(1-393, translated) 131 residues 

SEGYYNDTVC HRITTSGIYV LQCGDPSSTG AGGPGFS FAN EYPTDEATDL TTFVIYERGT IAMANAGADT 
NGSQFFLNYE DSPLAPNYTY FGQITEEGLA TLDAIAEVGT EGGTGDGAPA QEVRIE SAAV A 
>RXA01 837 -nucleotide sequence B: coding region 

TCGGAGGGCTACTACAACGATACTGTCTGCCACCGCATCACCACCTCTGGCATTTACGTTCTCCAGTGCGGCGATCC 

AAGCAGCACCGGCGCAGGCGGCCCAGGGT TCAGC T TCGCCAACGAATACCCAACCGACGAAGCAAC TGACCTAACCA 

CCCCAGTCATCTACGAGCGCGGCACCATCGCCATGGCCAACGCTGGCGCTGACACCAACGGCTCCCAGTTCTTCCT 

AACTACGAGGATTCCCCACTGGCACCGAACTACACCTACTTCGGCCAGATCACCGAAGAAGGCCTTGCAACCCTCGA 

CGCCATCGCAGAAGTTGGCACTGAAGGTGGAACCGGCGACGGAGCA^ 

CTGTTGCG 

>RXA01837-nucleotide sequence C: downstream 
TAAGTTCTAAGCCCCTCTTCTTT 
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Appendix A & B 

»RXA02174-amino acid sequence 
(1-354, translated) 118 residues 

MEKPQIELPV GPAPEDLVIS DIIVGEGAEA RPGGEVEVHY VGVDFETGEE FDSSWDRGQT SQFPLNGLIA 
GWQEGIPGMK VGGRRQLTIP PEAAYGPEGS GHPLSGRTLV FIIDLISA 
>RXA02 17 4 -nucleotide sequence A: upstream 

CTTCGGCCGAAGCCACTTCGTCTGTCATAATGACAGGGATGGTTTCGGCCGTTTTTGCATGAAACCAAAAAATACGA 
TTTTCAAGGAGCATGTACAGCAC 

>RXA02174-nucleotide sequence B: coding region 

ATGGAAAAGCCACAGATTGAGCTACCGGTCGGTCCAGCACCGGAAGATCTCGTAATCTCTGACATCATCGTTGGCGA 

AGGAGCAGAAGCCCGCCCAGGTGGAGAAGTTGAGGTCCACTATGTGGGCGTTGACTTTGAAACCGGCGAGGAGTTTG 

ACTCTTCCTGGGATCGTGGACAGACCAGCCAGTTCCCACTCAACGGCCTCATTGCAGGTTGGCAAGAGGGAATTCCA 

GGCATGAAGGTCGGCGGACGTCGTCAGCTGACCATTCCGCCAGAGGCTGCTTACGGCCCTGAGGGTTCCGGCCACCC 

ACTGTCTGGCCGTACCCTGGTGTTCATCATCGATTTGATCAGCGCA 

>RXA02 17 4 -nucleotide sequence C: downstream 

TAATTTTCTTTACTGCGCTAAAC 
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Appendix A & B 



»RXA01072-amino acid sequence 
(1-231 , translated) 77 residues 

MAI TVYTKPA CVQCNATKKA IX) RAGLE YD L VDISLDEEAR EYVLALGYLQ APVWADGSH WSGFRPERIR 
EMATAAA 

>RXA0 107 2 -nucleotide sequence A: upstream 

GTGTCGCC T TGAGCGATGCAAAGAC TT TAAAAACAGATCTAAAAAAGTAAGATGAGGCATT AGCGTCAACCGC TAAA 
GCCGATAAGGGAAGGTCCAAAAA 

>RXA01 07 2 -nucleotide sequence B: coding region 

ATGGCAATGACCGTTTACACCAAGCCAGCTTGCGTCCAGTGCAATGCCACCAAGAAGGCCCTCGACCGCGCTGGTCT 

TGAGTATGACCTCGTTGATATCAGCCTTGATGAAGAGGCACGTGAGTACGTCCTCGCACTTGGCTACCTGGAGGCAC 

CAGTTGTCGTTGCAGATGGCTCCCACTGGTCCGGTTTCCGCCCAGAGCGCATCCGTGAAATGGGAACCGCAGCTGCC 

>RXA0 10 72 -nucleotide sequence C: downstream 

TAAACTGCACTTCGTGGCACGAC 



BASF Aktiengesellsclxaft 990674 O.Z. 0050/50181 

Appendix A & B 

»RXA00824-amino acid sequence 
(1-558, translated) 186 residues 

MTSSAKWSIV GWAIIAVIV AL I PQIiVGGE SAEEAQGETS TSKITTRPDC VASGAAGVDL PCLGGANGVG 
NELATWNLW AWWCEPCRAE LPIFDEFATT HPELNVIGVH ADQNAANGAA LLEDLGVNLA SYQDDSNLFA 
GTLGLPGWP ITIWSPDGN WDTFPQPFE TIDDLETAVA GALQNA 
>RXA00824-nucleotide sequence A: upstream 

T T TGGT T TGGAGGGGCCGTCAGATCCAT T TGAGGCGCAAAAAC TCATTAAAAGTGATGATAGGGAGCACC TGCTGAA 
AATGGCAGGAATGTAGAAAACAA 

>RXA00824-nucleotide sequence B: coding region 

ATGACAAGCAGTGCAAAGTGGTCCATCGTTGGAGTTGTCGCCATCCTGGCTGTGATCGTTGCGTTAATCCCGCAGCT 
TGTGGGTGGAGAAAGCGCAGAGGAAGCGCAGGGGGAGACGTCGACAAGCAAAATTACGACGCGTCCTGACTGCGTGG 
CCTCTGGCGCGGCGGGTGTGGATCTGCCCTGCTTGGGCGGCGCCAACGGCGTCGGCAACGAGCTGGCCACCGTGGTG 
AATCTGTGGGCGTGGTGGTGCGAACCGTGCCGCGCGGAGCTGCCGATTTTTGATGAATTCGCCACCACCCACCCCGA 
ACTCAACGTCATTGGCGTGCATGCAGACCAAAACGCAGCCAACGGCGCCGCACTCCTTGAGGATCTGGGCGTGAATC 
TTGCAAGCTACCAAGACGATTCCAACCTGTTCGCAGGCACCCTTGGGCTGCCGGGCGTCGTGCCGATCACCATCGTG 
GTTTCTCCAGACGGCAATGTAGTGGACACCTTCCCGCAGCCTTTCGAAACCATCGATGACCTCGAAACCGCTGTGGC 

AGGGGCGCTGCAGAATGCG 

>RXA00824-nucleotide sequence C: downstream 
TAACTACCCTGATTTACCGCATG 



BASF Aktiengesellschaft 990674 O.Z. 0050/50181 

Appendix A & B 

»RXA00568-amino acid sequence 
(1-1347, translated) 449 residues 

VKSSVEKLSD TRSKITVEVP FSELKPEIDQ AYAAIAQQVQ IPGFRKGKAP RQLIDARFGR GAVLEQWND 
MLPNRYAQAI EAEGIKAIGQ PNVEVTKIED NELVEFVAEV DVRPE FELPK FED I TVEVPA IKADEEAIEA 
ELETI*RARFS TLKDHNHKLK KGEFVTINIT AS IDGEKIEE ATTEGLSYEI GSDDLIDGLD KALIGAKKDE 
TVEFTSELAN GEHKGKEAQI SVEITATKQR ELPELDDEFA QIASEFDTIE ELRESTVSDV EAKQKNEQAA 
AIRDEVLAAA LGEADFALPQ SIVDEQAHSQ LHQLLGELAH DDAALNST.TiE AQGTTREEFD KKNVEDAEKA 
VRTQLFLDTL SEVEEPEVSQ QELTDHILFT AQSYGMDPNQ FIGQLQQSGQ IAKLFSDVRR GKALAQAICR 
VNVKDSEGNE IDPKEYFGEE EVAETESEA 
>RXA0 05 68 -nucleotide sequence A: upstream 

GTTTCGGACGACGCGAGAAATCGCATTAATCGTCGGAAACCGGGCTTGTTTTTGTAATATCTGAAACTTTCCCTTTC 
CCGATCATCCAGGAGATTTACTC 

>RXA00568-nucleotide sequence B: coding region 

GTGAAGAGTTCTGTCGAGAAGCTGAGCGACACCCGTTCAAAGATCACCGTTGAGGTTCCATTTTCTGAACTGAAGCC 
AGAGATCGACCAGGCATACGCCGCTCTAGCGCAGCAAGTfcCAGATCCCTGGTTTCCGTAAGGGCAAGGCACCGCGTC 
AGCTTATCGACGCACGCTTCGGCCGTGGTGCGGTTCTGGAGCAGGTTGTCAACGACATGCTTCCTAACCGCTACGCA 
CAGGCAAT C GAAGC T GAGGGCAT CAAGGCAAT C GGC CAGC C T AAC G T AGAGG T CAC CAAGAT C GAAGACAAC GAGC T 
CGTTGAGTTCGTCGCTGAGGTTGACGTTCGCCCAGAGTTCGAGCTTCCTAAGTTCGAGGACATCACTGTTGAGGTCC 
CAGCTATCAAGGCTGACGAAGAGGCAATCGAAGCAGAGCTCGAGACCCTGCGTGCACGTTTCTCCACCTTGAAGGAT 
CACAACCACAAGCTGAAGAAGGGTGAGTTCGTCACCATCAACATCACCGCAAGCATTGA 

GGCAACCACTGAGGGTCTGTCCTACGAAATCGGATCTGACGATCTGATTGACG^CCTGGACAAGGCTCTGATCGGCG 
C T AAGAAGGAT GAAACC GT AGAG T T CAC C T C T GAGC T GGCAAAC GGCGAGCACAAGGGCAAGGAAGC T CAAAT CAGC 
GTTGAGATCACCGCAACCAAGCAGCGCGAGCTGCCTGAGCTGGATGATGA 

CACCATCGAAGAGCTTCGTGAGTCCACCGTGTCTGACGTTGAGGCTAAGCAGAAGAACGAGC^ 

GCGACGAAGTTCTCGCTGCGGCTCTTGGCGAGGCTGACTTCGCTCTGCCACAGTCCATCGTTGACGAGCAGGCACAC 
TCCCAGCTGC^CCAGCTCCTCGGCGAGCTTGCACACGACGATGCTGCACTGAACTCCCTCCTTGAGGCTCAGGGCAC 
CACTCGTGAAGAGTTCGAGAAGAAGAACGTCGAAGATGCT 

TCTCTGAGGTTGAGGAGCCTGAGGTTTCCC^GC^GGAGCTCACCGACCACATCCTGTTCACCGCACAGTCTTACGGC 

ATGGACCCAAACCAGTTCATCGGTCAGCTGCAGCAGTCCGGCCAGATCGCGAACCTCTTCTCCGACGTTCGCCGTGG 

CAAGGCTCTTGCACAGGCTATCTGCCGCGTAAACGTGAAGGACTCCGAGGGTAACGAGATCGACCCTAAGGAATACT 

TCGGTGAAGAAGAAGTAGCTGAGACTGAGTCTGAAGCT 

>RXA0 05 6 8 -nucleotide sequence C: downstream 

TAAAAAC T T T AAAGAAAT AACGC 



Claims 

1 . An isolated nucleic acid molecule from Corynebacterium glutamicum encoding an 
SES protein, or a portion thereof. 

2 The isolated nucleic acid molecule of claim 1. wherein said nucleic acid molecule 
encodes an SES protein involved in the production of a fine chemical. 

3 An isolated Corynebactertum glutamicum nucleic acid molecule selected from the 
group consisting of those sequences set forth in Appendix A, or a portion tnereol. 

4. An isolated nucleic acid molecule which encodes a polypeptide sequence selected 
from the group consisting of those sequences set forth in Appendix B. 

5 An isolated nucleic acid molecule which encodes a naturally occurring allelic variant 
of a polypeptide selected from the group of amino acid sequences consisting of those 
sequences set forth in Appendix B. 

6 An isolated nucleic acid molecule comprising a nucleotide sequence which is at least 
50% homologous to a nucleotide sequence selected from the group consisting ot 
those sequences set forth in Appendix A, or a portion thereof. 

7 An isolated nucleic acid molecule comprising a fragment of at least 1 5 nucleotides 
of a nucleic acid comprising a nucleotide sequence selected from the group 
consisting of those sequences set forth in Appendix A. 

8. An isolated nucleic acid molecule which hybridizes to the nucleic acid molecule of 
any one of claims 1-7 under stringent conditions. 

9. An isolated nucleic acid molecule comprising the nucleic acid molecule of any one 
of claims 1-8 or a portion thereof and a nucleotide sequence encoding a heterologous 
polypeptide. 

10. A vector comprising the nucleic acid molecule of any one of claims 1-9. 

1 1 . The vector of claim 1 0. which is an expression vector. 

1 2. A host cell transfected with the expression vector of claim 1 1 . 

13. The host cell of claim 12, wherein said cell is a microorganism. 

14. The host cell of claim 13, wherein said cell belongs to the genus Coryne bacterium 

or Brevibacterium. 

1 5 The host cell of claim 12. wherein the expression of said nucleic acid molecule 
SnS modulation in production of a fine chemical from sa,d cell. 



16. The host cell of claim 15, wherein said fine chemical ,s selected ta^P 
consisting of: organic acids- proteinogenic and ^^Z^^^^ 
and pyrirnidine bases, nucleosides, nucleotides, lipids, saUira ed "^rattu^ 
fattyacids, diois, carbohydrates, aromatic compounds, vitamins, cofactors, and 

enzymes. 

17. A method of producing a polypeptide comprising culturing ^^^^j?^ ° f ^ 12 
in an appropriate culture medium to. thereby, produce the polypeptide. 

18. An isolated SES polypeptide from CorynebacteHum glutamicum, or a portion 
thereof. 

19. The polypeptide of claim 18, wherein said polypeptide is involved in the production 
of a fine chemical production. 

20. An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of those sequences set forth m Appendix B. 

2 1 An isolated polypeptide comprising a naturally occurring allelic variant of a 

' ^ amino acid sequence selected from the group consisting 

of those sequences setforth in Appendix B, or a portion thereof. 

22. The isolated polypeptide of any of claims 18-21, further comprising heterologous 
amino acid sequences. 

23 An isolated polypeptide which is encoded by a nucleic acid molecule comprising a 
n^cirotS sequent which is at least 50% homologous ,to a ^deie * cld seleCted 
from the group consisting of those sequences set forth m Appendix A. 

sequences set forth in Appendix B. 

25 A method for producing a fine chemical, comprising culturing a cell containing a 
veSor of claim 12 such that the fine chemical is produced. 

26 The method of claim 25. wherein said method further comprises the step of 
recovering the fine chemical from said culture. 

vector. 

28. The method of elate 25, wherein said cell belongs to the genus Corynebacurium or 

> Brevibacterium 

2*. T*. method d*» 25. wherein 



Corynebacterium acetophilum. Corynebacterium ammoniagenes. Corynebacterium 
fujiokense. Corynebacterium nitrilophilus. Brevibacterium ammoniagenes 
Brcyribacterium but am cum. Brevibacterium divaricatum. Brevibacterium flavum. 
Brevibacterium he alii, Brevibacterium ketoglutamicum. Brevibacterium 
ketosoreductum. Brevibacterium lactofermentum. Brevibacterium linens. 
Brevibacterium paraffinolyticum. and those strains set forth m Tabic .> 

30. The method of claim 25, wherein expression of the nucleic acid molecule from said 
vector results in modulation of production of said fine chemical. 

3 1 The method of claim 25, wherein said fine chemical is selected from the group 

consisting of: organic acids, proteinogenic and nonproteinogenic amino acids, purine 
and pyrimidine bases, nucleosides, nucleotides, lipids, saturated and unsaturated 
fatty acids, diols. carbohydrates, aromatic compounds, vitamins, cofactors. and 
enzymes. 

32. The method of claim 25. wherein said fine chemical is an amino acid. 

33 The method of claim 32. wherein said amino acid is drawn from the group consisting 
of: lysine, glutamate, glutamine. alanine, aspartate, glycine, serine, threonine, 
methionine, cysteine, valine, leucine, isoleucine. arginine. proline, histidine, 
tyrosine, phenylalanine, and tryptophan. 

34. A method for producing a fine chemical, comprising culturing a cell whose genomic 
DNA has been altered by the inclusion of a nucleic acid molecule of any one ot 
claims 1 -9. 
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